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gaagagttga ttgagaagtg cctcttggtt aaggattaac cacagggaaa aatccagcag 60 

aaacagaaga actgtgggtt tcttacccca gccctcaagg aagctatgcc gtgaaagggg 120 

tactgataca ctgacataca gcaagttgga cggggcatca gttcttcatt tgtggagtgg 180 

agaaaagaag aggaaatctc tcatttgggg catttgaagg atggcttccc tgtttcatca 240 

gcttcagatc ctggtctgga aaaattggct aggtgtaaaa aggcagccgc tttggacact 300 

tgtcttgatc ttatggccag tcattatttt cataattttg gctattactc ggaccaaatt 360 

tcctccaact gcaaaaccaa cttgttacct cgcacctcga aaccttccta gtactggatt 420 

ctttccattc ctgcagaccc tactctgtga cacagactct aaatgcaaag acacacccta 480 

tggcccacaa gatctgcttc gtaggaaagg aattgatgat gcactattta aagacagtga 540 

gattctgaga aagtcatcca acctggataa ggacagcagt ttatcattcc agagcaccca 600 

U agttccagaa agaaggcatg catcactagc cacagtattt cccagtccaa gttctgattt 660 

I:? ggaaatcccc ggaacatata ctttcaatgg cagtcaagtg ctcgcacgaa ttcttggctt 720 

SI ggaaaagctg ttaaagcaaa attcaacttc agaagatata cgaagagaac tatgtgacag 780 

[J{ ctattcagga tacattgtgg atgatgcctt ctcttggacc tttctaggaa gaaatgtttt 840 

p taacaaattt tgcctttcta acatgaccct tttagagtct tctctccaag aactaaacaa 900 

* r acagttctcc cagctatcca gtgaccccaa caatcagaag atagtgtttc aggaaatagt 960 

CI cagaatgctg tctttcttct cacaagtgca agagcagaaa gctgtgtggc agcttctgtc 1020 

jil. tagttttcca aatgtgtttc agaatgacac atcactaagc aatctatttg atgttcttcg 1080 

Jj| aaaggcaaac agtgtgctgc tggttgtgca gaaggtttat ccacgttttg caactaacga 1140 

fff aggtttcaga accctccaga agtctgttaa acatctgctg tacactctgg actccccagc 1200 

tcaaggtgac tccgataata taacgcatgt gtggaatgag gatgatggac agaccttatc 1260 

tccaagcagt ctggctgcac agctcctaat tctggaaaac tttgaagatg ccctcttaaa 1320 

tatatcagca aatagtcctt atattcctta cttggcatgt gtgagaaatg tgactgacag 1380 

tttggccaga ggttcaccag aaaatctaag actcctgcag tccacaatac gatttaaaaa 1440 

atcttttctt cgcaatggtt cctatgaaga ttactttcct ccagttcctg aagtcctaaa 1500 

atcaaaactg tctcaacttc gaaacttgac cgaacttctt tgtgaatctg aaactttcag 1560 

tttgatagag aagtcatgcc agctctctga tatgagcttt gggagcctgt gtgaagaaag 1620 

tgagtttgat ctgcaactcc tcgaagcggc agagctgggc accgaaatag cagccagctt 1680 

actgtaccat gacaatgtca tatctaaaaa agtgagagat ttgctgactg gagatccaag 1740 

caaaattaat ttaaatatgg atcagtttct agaacaggca ctgcaaatga attacttgga 1800 

aaatatcact cagttaatac cgatcataga agccatgctg catgtcaata acagtgcaga 1860 
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tgcttctgaa aagccaggtc agttactaga aatgtttaaa aatgttgaag agctgaaaga 1920 

agatttaagg agaacaacag gaatgtccaa caggactatt gacaagttgc tggccattcc 1980 

catccctgat aatagagctg agattatttc tcaggtgttc tggctgcatt cctgtgatac 2040 

taatatcacc actcccaaac tagaagatgc aatgaaagaa ttctgcaacc tgtctctttc 2100 

agagagatcc cggcagtctt acctcatcgg actcaccctt ctgcactact taaacattta 2160 

caacttcaca gacaaggtgt ttttcccgag gaaagatcaa aagccagtag aaaagatgat 2220 

ggagctcttc ataagactaa aagagattct caatcagatg gcttctggca cacatccgct 2280 

gctagacaaa atgagatccc tgaagcaaat gcatctgccc agaagtgttc cattaacaca 2340 

ggcaatgtac agaagcaacc gaatgaacac accacaagga tcatttagca ccatctccca 2400 

agcattatgt tctcaaggaa ttaccactga atatttaact gccatgctgc cctcttccca 2460 

gaggccaaaa ggcaaccaca ccaaggattt tttgacttat aaattaacta aagagcaaat 2520 

s tgcttcaaaa tatggaattc ccataaatac cacaccattt tgcttctccc tttataaaga 2580 

CI catcattaac atgcccgctg gacctgtgat ttgggctttc ttgaaaccta tgttgttggg 2640 

m 

Jj aagaattttg catgcaccat ataacccagt cacaaaggca ataatggaaa agtccaatgt 2700 

W aactctgaga cagctggcgg aattaagaga aaaatctcaa gagtggatgg ataagtcgcc 2760 

Q acttttcatg aattccttcc atctgttaaa ccaggcaatt ccaatgctcc agaatactct 2820 

■G3 aaggaaccct tttgtgcaag tttttgtaaa gttctccgtg ggactcgatg ctgttgaact 2880 

p attgaaacag atagatgaac tcgatattct aagactgaaa ttagagaaca acattgacat 2940 

catcgatcag cttaacacac tatcttccct gacagtaaat atttcctctt gtgtattata 3000 

f!J tgaccgtatt caggcagcaa aaaccataga tgaaatggag agagaggcta aaaggctcta 3060 

|J caaaagcaac gaactctttg gaagtgttat ttttaagctt ccttctaaca gaagctggca 3120 

cagaggctat gactctggaa atgtctttct tcctcctgtc ataaaatata ccatccggat 3180 

gagtctcaag accgcacaga ccacaagaag cctaagaacc aagatttggg ctccagggcc 3240 

acacaattct ccatcacaca accagatcta tggcagggct tttatttatt tacaggatag 3300 

tattgaaaga gcaatcattg aattgcaaac tggaaggaac tcccaggaaa tagcagtcca 3360 

ggttcaagca attccttatc cctgcttcat gaaagacaac ttcctaacca gtgtctctta 3420 

ttctcttcca attgtgctta tggttgcctg ggttgtattt atagctgcct ttgtaaaaaa 3480 

gcttgtctat gagaaagacc tccggcttca tgagtacatg aagatgatgg gtgtgaactc 3540 

ctgcagccat ttctttgcct ggcttataga gagtgttgga tttttactgg ttaccatcgt 3600 

gatcctcatc attatactca agtttggcaa tattcttcct aaaacaaatg ggttcatttt 3660 

gttcctgtat ttttcggact acagcttctc ggttattgcc atgagctatc ttatcagtgt 3720 

cttcttcaac aacaccaaca ttgcagctct gatcggaagc ctcatctaca tcattgcctt 3780 
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ctttccattt attgttctgg ttacagtgga gaatgagttg agctatgtat tgaaagtgtt 3840 



catgagcctg ctgtccccaa cagcattcag ctatgcaagc caatacattg cacgatacga 3900 

agaacagggc attggtcttc agtgggaaaa tatgtacacc tccccggttc aggatgacac 3960 

cacctcattt ggctggctgt gctgtctaat cctagctgac tctttcattt atttccttat 4020 

tgcttggtat gtcaggaatg tcttcccagg gacatacggt atggcagctc cctggtattt 4080 

tccaattctt ccttcctatt ggaaggagcg atttgggtgt gcagaggtga agcctgagaa 4140 

gagcaatggc ctcatgttta ctaacatcat gatgcagaac accaacccat ctgccagtcc 4200 

tgaatacatg ttttcctcta acatcgagcc tgaacctaaa gatctcacag tcggggttgc 4260 

cctgcatggg gtcacaaaga tctatggctc aaaagttgct gttgataacc tcaatctgaa 4320 

cttttatgaa gggcatatta cttcattgct ggggcccaat ggagctggga aaactactac 4380 

catttccatg ttaactgggc tgtttggggc ctcagcaggc accatttttg tatatggaaa 4440 

M agatatcaaa acagacctac acacggtacg gaagaacatg ggagtctgta tgcagcacga 4500 

C! 

Fi cgtcttgttc agttacctca ctactaagga gcaccttctc ctatatggtt ccatcaaagt 4560 

2\ tcctcactgg actaaaaagc agctccacga ggaagtaaaa aggactttaa aagatactgg 4620 

y;I actatatagc catcgtcata agagagttgg aacactgtca ggaggcatga agaggaagtt 4680 

M atctatatcc atagctctca ttggtggatc aagggtagta attttggatg aaccatctac 4740 

? tggagttgac ccatgttctc gccgaagtat atgggatgtt atatccaaga acaaaactgc 4800 

E] cagaacaatc attctgtcaa cgcaccactt ggacgaggct gaagtgctga gtgaccgcat 4860 

M : cgccttcctg gagcagggtg ggcttaggtg ctgtgggtcc ccattttacc tcaaggaagc 4920 

pi: !; 

f ctttggcgat gggtatcacc tcacgcttac caagaagaag agtccaaatt taaatgcaaa 4980 

^ tgcagtatgt gacaccatgg ccgtgacagc aatgatccaa tcacatctcc ccgaagccta 5040 

cctcaaggag gatattgggg gagagcttgt ttatgtactt cctccattca gcaccaaagt 5100 

ctcaggggcc tacctgtcac tcctacgggc actcgacaat ggcatgggtg acctcaacat 5160 

cgggtgctac ggcatttcag ataccaccgt ggaggaggtc tttctgaact tgaccaaaga 5220 

gtcacaaaaa aatagtgcta tgagtcttga gcacttaaca caaaagaaaa ttgggaattc 5280 

caatgccaat ggcatctcaa ctcctgacga tttatctgtg agcagcagca atttcacaga 5340 

cagagatgac aaaatcctga caagaggaga gaggctggat ggctttggac tgttgctgaa 5400 

gaagatcatg gctatactca tcaagaggtt ccaccacacc cgcaggaact ggaaaggtct 5460 

cattgctcag gttatcctcc ccatcgtctt tgttaccact gccatgggcc ttggcacact 5520 

gagaaattcc agcaacagtt atccagagat tcagatctcc ccctctcttt atggtacctc 5580 

cgaacagaca gccttctatg ctaattatca cccgagcacg gaagcacttg tctcagcaat 5640 

gtgggacttc cctggaattg acaacatgtg tctgaacacc agtgatctac agtgtttaaa 5700 
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5820 
5880 



6000 
6060 



caaagacagt ctggaaaaat ggaacaccag tggagaaccc atcactaatt ttggtgtttg 5760 
ctcctgctca gaaaatgtcc aggaatgtcc taaatttaac tattccccac cgcacagaag 
aacttactca tcccaggtaa tttataacct cactgggcaa cgagtggaaa attatcttat 
atcaactgca aatgagtttg tccaaaaaag atatggaggt tggagttttg ggctgccttt 5940 
gacaaaagac cttcgttttg atataacagg agtccctgcc aatagaacac ttgccaaggt 
atggtatgat ccagaaggct atcactccct tccagcttac ctcaacagcc tgaataattt 
ccttctgcga gttaacatgt caaaatacga tgctgcccga catggcatca tcatgtatag 6120 
ccatccttat ccaggagtgc aagaccaaga acaagccaca atcagcagtt taatcgatat 6180 
tttagtggca ctgtctatct tgatgggcta ctctgtcacc accgccagct ttgtcaccta 6240 
tgttgtaagg gaacatcaaa ccaaagccaa acagttgcag cacatttcag gcattggcgt 6300 
gacatgctac tgggtaacaa acttcattta tgacatggtt ttctacttgg tgcctgtagc 6360 
j ' gttttcaatt ggtatcattg cgattttcaa attacctgca ttctacagtg aaaacaacct 6420 
f aggcgctgta tctctcctac ttctcctgtt tgggcatgca acattttcct ggatgtactt 



6480 



6600 
6660 



6840 
6900 



Jjgctggctggg ctcttccatg aaacaggaat ggccttcatc acttacgtct gtgtcaactt 6540 
gttttttggc attaattcca ttgtttccct gtcagtggta tactttcttt ccaaggaaaa 

fH 

gcctaatgat ccgactttag aacttatttc tgaaaccctc aagcgcattt tcctgatttt 

s cccacaattc tgttttggct acggtttgat tgaactttct caacaacagt cggtcctaga 6720 

K cttcttaaaa gcatatggag tggaataccc aaatgaaacc tttgagatga ataaactagg 6780 
H tgcaatgttt gtggctttgg tttctcaggg caccatgttt ttttccttgc gactcttaat 
p caacgaatcc ctgataaaga aactcaggct tttcttcaga aaatttaatt cttcacatgt 

111 aagggagaca atagatgagg atgaagatgt gcgggctgag agattaagag ttgagagtgg 6960 

tgcagctgaa tttgacttgg tccaacttta ttgtctcaca aagacctacc aacttatcca 7020 

caaaaagatt atagctgtaa acaacatcag catcgggata cctgctggag agtgttttgg 7080 

gcttcttgga gtgaatggag caggaaagac cactatattc aagatgctga caggagacat 7140 

cattccttca agtggaaaca ttctgatcag aaataagacc ggatctctgg gtcacgttga 7200 

ttctcacagc tcattagttg gctactgtcc tcaggaagat gccttagatg acctggtaac 7260 

tgtggaagaa catttgtatt tctatgccag ggtacatgga attccagaaa aggatattaa 7320 

agaaactgtt cataaactcc ttaggagact tcacctgatg cccttcaagg acagagctac 7380 

ctctatgtgc agttatggca caaaaagaaa attatccact gcactggcct tgatagggaa 7440 

accttccatt ctactgctgg atgagccgag ctctggcatg gatccgaagt cgaaacggca 7500 

cctctggaag atcatttcag aagaagtaca gaacaaatgt tccgtcatcc tcacatctca 7560 

cagcatggaa gaatgtgaag ctctctgtac caggttggcc attatggtga atggaaagtt 7620 
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tcaatgtatt ggatctttgc agcacataaa gagcaggttt ggacgaggat ttactgtcaa 7680 

agttcacttg aagaataaca aagtgaccat ggagaccctc acaaagttca tgcagctgca 7740 

ctttccaaaa acatacttaa aagatcagca cctcagcatg ctagagtatc atgtaccagt 7800 

cacagcagga ggagtcgcaa acatttttga tctgctggaa accaacaaga ctgctttaaa 7860 

tattacaaat ttcttagtga gtcagaccac tctggaagag gttttcatca actttgccaa 7920 

agaccagaag tcctatgaaa ctgctgatac cagcagccaa ggttccacta taagtgttga 7980 

ctcacaagat gaccagatgg agtcttaaca cttccagcaa actcaatctc agcgtgtgac 8040 

caatggcttc attttgaaga aaagccacag aagatacact tccgcaagat atcttcattt 8100 

taaagtaaag taatatactg tatggaaagt tacaactgtg ttagactaac aagtaattat 8160 

aaaaggaaat ttttccttct aaggtcagtg agtgttgttg ctactgaaat gaattcctgt 8220 

atactcaaca ctgtgagcat gctaatgtat atgctggtga ttcttatgca aaggtgaagc 8280 

cacctcaaga tgaatatctt aatttattac tttcaataaa aagacagttt aaaaggcatg 8340 

|j gattttggta gttgaaatat aagagtggag aagaaaagtc agatggtttg tggcaggtgc 8400 

"SI caccgggcaa gcagacaaca taatttattt ccagaaaaca acagaatgaa catcatcatg 8460 

aatacatgaa tcggctgtga tgtgtgaact gctaagggcc aaatgaacgt ttgnagagca 8520 

0 gtgggcacaa tgtttacaat gtatgngtat gtcactttcg gtaccngtga atgcatgggg 8580 

!/' acgtgctgaa cccgaaaaaa agtgcctttc cataaggact gcaatagaga gggcaattta 8640 

O ccctggtggt acacggaacc tagattcact cctgccatnc cttgccaata gtaagctgca 8700 

yi gggtggaaca agaaatcact tgctctgggg ggaagggagg ggggaatggg tgtgtcagct 8760 

gggtagatac aaaccctgaa aagagaatcc atgtgctnct ggcaggcaac attttttaaa 8820 

gctctttcag aaaccctcat atttggggtt tcttttcagg aaacattcct gtggagggaa 8880 

aacgaatatg aagataattt tcagctaatt atctgggtga cccagaatcg tgtatatggc 8940 

tataggatag acttcttaat aatggcaagt gacgtggccc tggggaaagg tgctttatgt 9000 

accgtgtgtg cgtgtatgtg tgtgtatcta tacaagtttg tcagctttgg catgactgtt 9060 

tgtctcgaaa accaataaac tcaaagttta gaaaaactca aaaaaaaaaa aa 9112 

<210> 2 
<211> 8875 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> n 



5 -r 



<222> (1) . . (8875) 
<223> n = a, t, g, or c 



gaagagttga ttgagaagtg cctcttggtt aaggattaac cacagggaaa aatccagcag 60 

aaacagaaga actgtgggtt tcttacccca gccctcaagg aagctatgcc gtgaaagggg 120 

tactgataca ctgacataca gcaagttgga cggggcatca gttcttcatt tgtggagtgg 180 

agaaaagaag aggaaatctc tcatttgggg catttgaagg atggcttccc tgtttcatca 240 

gcttcagatc ctggtctgga aaaattggct aggtgtaaaa aggcagccgc tttggacact 300 

tgtcttgatc ttatggccag tcattatttt cataattttg gctattactc ggaccaaatt 360 

tcctccaact gcaaaaccaa cttgttacct cgcacctcga aaccttccta gtactggatt 420 

ctttccattc ctgcagaccc tactctgtga cacagactct aaatgcaaag acacacccta 480 

tggcccacaa gatctgcttc gtaggaaagg aattgatgat gcactattta aagacagtga 540 

gattctgaga aagtcatcca acctggataa ggacagcagt ttatcattcc agagcaccca 600 

agttccagaa agaaggcatg catcactagc cacagtattt cccagtccaa gttctgattt 660 

ggaaatcccc ggaacatata ctttcaatgg cagtcaagtg ctcgcacgaa ttcttggctt 720 

ggaaaagctg ttaaagcaaa attcaacttc agaagatata cgaagagaac tatgtgacag 780 

ctattcagga tacattgtgg atgatgcctt ctcttggacc tttctaggaa gaaatgtttt 840 

taacaaattt tgcctttcta acatgaccct tttagagtct tctctccaag aactaaacaa 900 

acagttctcc cagctatcca gtgaccccaa caatcagaag atagtgtttc aggaaatagt 960 

cagaatgctg tctttcttct cacaagtgca agagcagaaa gctgtgtggc agcttctgtc 1020 

tagttttcca aatgtgtttc agaatgacac atcactaagc aatctatttg atgttcttcg 1080 

aaaggcaaac agtgtgctgc tggttgtgca gaaggtttat ccacgttttg caactaacga 1140 

aggtttcaga accctccaga agtctgttaa acatctgctg tacactctgg actccccagc 1200 

tcaaggtgac tccgataata taacgcatgt gtggaatgag gatgatggac agaccttatc 1260 

tccaagcagt ctggctgcac agctcctaat tctggaaaac tttgaagatg ccctcttaaa 1320 

tatatcagca aatagtcctt atattcctta cttggcatgt gtgagaaatg tgactgacag 1380 

tttggccaga ggttcaccag aaaatctaag actcctgcag tccacaatac gatttaaaaa 1440 

atcttttctt cgcaatggtt cctatgaaga ttactttcct ccagttcctg aagtcctaaa 1500 

atcaaaactg tctcaacttc gaaacttgac cgaacttctt tgtgaatctg aaactttcag 1560 

tttgatagag aagtcatgcc agctctctga tatgagcttt gggagcctgt gtgaagaaag 1620 

tgagtttgat ctgcaactcc tcgaagcggc agagctgggc accgaaatag cagccagctt 1680 
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actgtaccat gacaatgtca tatctaaaaa agtgagagat ttgctgactg gagatccaag 1740 

caaaattaat ttaaatatgg atcagtttct agaacaggca ctgcaaatga attacttgga 1800 

aaatatcact cagttaatac cgatcataga agccatgctg catgtcaata acagtgcaga 1860 

tgcttctgaa aagccaggtc agttactaga aatgtttaaa aatgttgaag agctgaaaga 1920 

agatttaagg agaacaacag gaatgtccaa caggactatt gacaagttgc tggccattcc 1980 

catccctgat aatagagctg agattatttc tcaggtgttc tggctgcatt cctgtgatac 2040 

taatatcacc actcccaaac tagaagatgc aatgaaagaa ttctgcaacc tgtctctttc 2100 

agagagatcc cggcagtctt acctcatcgg actcaccctt ctgcactact taaacattta 2160 

caacttcaca gacaaggtgt ttttcccgag gaaagatcaa aagccagtag aaaagatgat 2220 

ggagctcttc ataagactaa aagagattct caatcagatg gcttctggca cacatccgct 2280 

gctagacaaa atgagatccc tgaagcaaat gcatctgccc agaagtgttc cattaacaca 2340 

h> ggcaatgtac agaagcaacc gaatgaacac accacaagga tcatttagca ccatctccca 2400 

S agcattatgt tctcaaggaa ttaccactga atatttaact gccatgctgc cctcttccca 2460 

S gaggccaaaa ggcaaccaca ccaaggattt tttgacttat aaattaacta aagagcaaat 2520 

y§ tgcttcaaaa tatggaattc ccataaatac cacaccattt tgcttctccc tttataaaga 2580 

p catcattaac atgcccgctg gacctgtgat ttgggctttc ttgaaaccta tgttgttggg 2640 

p aagaattttg catgcaccat ataacccagt cacaaaggca ataatggaaa agtccaatgt 2700 

Pij aactctgaga cagctggcgg aattaagaga aaaatctcaa gagtggatgg ataagtcgcc 2760 

M* acttttcatg aattccttcc atctgttaaa ccaggcaatt ccaatgctcc agaatactct 2820 

PI aaggaaccct tttgtgcaag tttttgtaaa gttctccgtg ggactcgatg ctgttgaact 2880 

ri| attgaaacag atagatgaac tcgatattct aagactgaaa ttagagaaca acattgacat 2940 

catcgatcag cttaacacac tatcttccct gacagtaaat atttcctctt gtgtattata 3000 

tgaccgtatt caggcagcaa aaaccataga tgaaatggag agagaggcta aaaggctcta 3060 

caaaagcaac gaactctttg gaagtgttat ttttaagctt ccttctaaca gaagctggca 3120 

cagaggctat gactctggaa atgtctttct tcctcctgtc ataaaatata ccatccggat 3180 

gagtctcaag accgcacaga ccacaagaag cctaagaacc aagatttggg ctccagggcc 3240 

acacaattct ccatcacaca accagatcta tggcagggct tttatttatt tacaggatag 3300 

tattgaaaga gcaatcattg aattgcaaac tggaaggaac tcccaggaaa tagcagtcca 3360 

ggttcaagca attccttatc cctgcttcat gaaagacaac ttcctaacca gtgtctctta 3420 

ttctcttcca attgtgctta tggttgcctg ggttgtattt atagctgcct ttgtaaaaaa 3480 

gcttgtctat gagaaagacc tccggcttca tgagtacatg aagatgatgg gtgtgaactc 3540 

ctgcagccat ttctttgcct ggcttataga gagtgttgga tttttactgg ttaccatcgt 3600 
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gatcctcatc attatactca agtttggcaa tattcttcct aaaacaaatg ggttcatttt 3660 

gttcctgtat ttttcggact acagcttctc ggttattgcc atgagctatc ttatcagtgt 3720 

cttcttcaac aacaccaaca ttgcagctct gatcggaagc ctcatctaca tcattgcctt 3780 

ctttccattt attgttctgg ttacagtgga gaatgagttg agctatgtat tgaaagtgtt 3840 

catgagcctg ctgtccccaa cagcattcag ctatgcaagc caatacattg cacgatacga 3900 

agaacagggc attggtcttc agtgggaaaa tatgtacacc tccccggttc aggatgacac 3960 

cacctcattt ggctggctgt gctgtctaat cctagctgac tctttcattt atttccttat 4020 

tgcttggtat gtcaggaatg tcttcccagg gacatacggt atggcagctc cctggtattt 4080 

tccaattctt ccttcctatt ggaaggagcg atttgggtgt gcagaggtga agcctgagaa 4140 

gagcaatggc ctcatgttta ctaacatcat gatgcagaac accaacccat ctgccagtcc 4200 

tgaatacatg ttttcctcta acatcgagcc tgaacctaaa gatctcacag tcggggttgc 4260 

cctgcatggg gtcacaaaga tctatggctc aaaagttgct gttgataacc tcaatctgaa 4320 

h cttttatgaa gggcatatta cttcattgct ggggcccaat ggagctggga aaactactac 4380 

fj catttccatg ttaactgggc tgtttggggc ctcagcaggc accatttttg tatatggaaa 4440 

flf agatatcaaa acagacctac acacggtacg gaagaacatg ggagtctgta tgcagcacga 4500 

|{ cgtcttgttc agttacctca ctactaagga gcaccttctc ctatatggtt ccatcaaagt 4560 

0 tcctcactgg actaaaaagc agctccacga ggaagtaaaa aggactttaa aagatactgg 4620 

Pi actatatagc catcgtcata agagagttgg aacactgtca ggaggcatga agaggaagtt 4680 

pi atctatatcc atagctctca ttggtggatc aagggtagta attttggatg aaccatctac 4740 

h| tggagttgac ccatgttctc gccgaagtat atgggatgtt atatccaaga acaaaactgc 4800 

E? cagaacaatc attctgtcaa cgcaccactt ggacgaggct gaagtgctga gtgaccgcat 4860 

cgccttcctg gagcagggtg ggcttaggtg ctgtgggtcc ccattttacc tcaaggaagc 4920 

ctttggcgat gggtatcacc tcacgcttac caagaagaag gtctttctga acttgaccaa 4980 

agagtcacaa aaaaatagtg ctatgagtct tgagcactta acacaaaaga aaattgggaa 5040 

ttccaatgcc aatggcatct caactcctga cgatttatct gtgagcagca gcaatttcac 5100 

agacagagat gacaaaatcc tgacaagagg agagaggctg gatggctttg gactgttgct 5160 

gaagaagatc atggctatac tcatcaagag gttccaccac gcccgcagga actggaaagg 5220 

tctcattgct caggttatcc tccccatcgt ctttgttacc actgccatgg gccttggcac 5280 

actgagaaat tccagcaaca gttatccaga gattcagatc tccccctctc tttatggtac 5340 

ctccgnacag acagccttct atgctaatta tcacccgagc acggaagcac ttgtctcagc 5400 

aatgtgggac ttccctggaa ttgacaacat gtgtctgaac accagtgatc tacagtgttt 5460 

aaacaaagac agtctggaaa aatggaacac cagtggagaa cccatcacta attttggtgt 5520 
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ttgctcctgc tcagaaaatg tccaggaatg tcctaaattt aactattccc caccgcacag 5580 

aagaacttac tcatcccagg taatttataa cctcactggg caacgagtgg aaaattatct 5640 

tatatcaact gcaaatgagt ttgtccaaaa aagatatgga ggttggagtt ttgggctgcc 5700 

tttgacaaaa gaccttcgtt ttgatataac aggagtccct gccaatagaa cacttgccaa 5760 

ggtatggtat gatccagaag gctatcactc ccttccagct tacctcaaca gcctgaataa 5820 

tttccttctg cgagttaaca tgtcaaaata cgatgctgcc cgacatggca tcatcatgta 5880 

tagccatcct tatccaggag tgcaagacca agaacaagcc acaatcagca gtttaatcga 5940 

tattttagtg gcactgtcta tcttgatggg ctactctgtc accaccgcca gctttgtcac 6000 

ctatgttgta agggaacatc aaaccaaagc caaacagttg cagcacattt caggcattgg 6060 

cgtgacatgc tactgggtaa caaacttcat ttatgacatg gttttctact tggtgcctgt 6120 

agcgttttca attggtatca ttgcgatttt caaattacct gcattctaca gtgaaaacaa 6180 

* cctaggcgct gtatctctcc tacttctcct gtttgggcat gcaacatttt cctggatgta 6240 

I cttgctggct gggctcttcc atgaaacagg aatggccttc atcacttacg tctgtgtcaa 6300 

j cttgtttttt ggcattaatt ccattgtttc cctgtcagtg gtatactttc tttccaagga 6360 

I aaagcctaat gatccgactt tagaacttat ttctgaaacc ctcaagcgca ttttcctgat 6420 

j tttcccacaa ttctgttttg gctacggttt gattgaactt tctcaacaac agtcggtcct 6480 

agacttctta aaagcatatg gagtggaata cccaaatgaa acctttgaga tgaataaact 6540 

I aggtgcaatg tttgtggctt tggtttctca gggcaccatg tttttttcct tgcgactctt 6600 

\ aatcaacgaa tccctgataa agaaactcag gcttttcttc agaaaattta attcttcaca 6660 

| tgtaagggag acaatagatg aggatgaaga tgtgcgggct gagagattaa gagttgagag 6720 

I tggtgcagct gaatttgact tggtccaact ttattgtctc acaaagacct accaacttat 6780 

ccacaaaaag attatagctg taaacaacat cagcatcggg atacctgctg gagagtgttt 6840 

tgggcttctt ggagtgaatg gagcaggaaa gaccactata ttcaagatgc tgacaggaga 6900 

catcattcct tcaagtggaa acattctgat cagaaataag accggatctc tgggtcacgt 6960 

tgattctcac agctcattag ttggctactg tcctcaggaa gatgccttag atgacctggt 7020 

aactgtggaa gaacatttgt atttctatgc cagggtacat ggaattccag aaaaggatat 7080 

taaagaaact gttcataaac tccttaggag acttcacctg atgcccttca aggacagagc 7140 

tacctctatg tgcagttatg gcacaaaaag aaaattatcc actgcactgg ccttgatagg 7200 

gaaaccttcc attctactgc tggatgagcc gagctctggc atggatccga agtcgaaacg 7260 

gcacctctgg aagatcattt cagaagaagt acagaacaaa tgttccgtca tcctcacatc 7320 

tcacagcatg gaagaatgtg aagctctctg taccaggttg gccattatgg tgaatggaaa 7380 

gtttcaatgt attggatctt tgcagcacat aaagagcagg tttggacgag gatttactgt 7440 
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caaagttcac ttgaagaata acaaagtgac catggagacc ctcacaaagt tcatgcagct 7500 

gcactttcca aaaacatact taaaagatca gcacctcagc atgctagagt atcatgtacc 7560 

agtcacagca ggaggagtcg caaacatttt tgatctgctg gaaaccaaca agactgcttt 7620 

aaatattaca aatttcttag tgagtcagac cactctggaa gaggttttca tcaactttgc 7680 

caaagaccag aagtcctatg aaactgctga taccagcagc caaggttcca ctataagtgt 7740 

tgactcacaa gatgaccaga tggagtctta acacttccag caaactcaat ctcagcgtgt 7800 

gaccaatggc ttcattttga agaaaagcca cagaagatac acttccgcaa gatatcttca 7860 

ttttaaagta aagtaatata ctgtatggaa agttacaact gtgttagact aacaagtaat 7920 

tataaaagga aatttttcct tctaaggtca gtgagtgttg ttgctactga aatgaattcc 7980 

tgtatactca acactgtgag catgctaatg tatatgctgg tgattcttat gcaaaggtga 8040 

agccacctca agatgaatat cttaatttat tactttcaat aaaaagacag tttaaaaggc 8100 

;K atggattttg gtagttgaaa tataagagtg gagaagaaaa gtcagatggt ttgtggcagg 8160 

S i: 

Cl tgccaccggg caagcagaca acataattta tttccagaaa acaacagaat gaacatcatc 8220 

pi atgaatacat gaatcggctg tgatgtgtga actgctaagg gccaaatgaa cgtttgnaga 8280 

5 gcagtgggca caatgtttac aatgtatgng tatgtcactt tcggtaccng tgaatgcatg 8340 

5 gggacgtgct gaacccgaaa aaaagtgcct ttccataagg actgcaatag agagggcaat 8400 

* ■ ttaccctggt ggtacacgga acctagattc actcctgcca tnccttgcca atagtaagct 8460 

f|| gcagggtgga acaagaaatc acttgctctg gggggaaggg aggggggaat gggtgtgtca 8520 

gctgggtaga tacaaaccct gaaaagagaa tccatgtgct nctggcaggc aacatttttt 8580 

HI 

Q aaagctcttt cagaaaccct catatttggg gtttcttttc aggaaacatt cctgtggagg 8640 

■ - F gaaaacgaat atgaagataa ttttcagcta attatctggg tgacccagaa tcgtgtatat 8700 

ggctatagga tagacttctt aataatggca agtgacgtgg ccctggggaa aggtgcttta 8760 

tgtaccgtgt gtgcgtgtat gtgtgtgtat ctatacaagt ttgtcagctt tggcatgact 8820 

gtttgtctcg aaaaccaata aactcaaagt ttagaaaaac tcaaaaaaaa aaaaa 8875 

<210> 3 

<211> 8350 

<212> DNA 

<213> Homo sapiens 

<400> 3 

gaagagttga ttgagaagtg cctcttggtt aaggattaac cacagggaaa aatccagcag 60 

aaacagaaga actgtgggtt tcttacccca gccctcaagg aagctatgcc gtgaaagggg 120 
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tactgataca ctgacataca gcaagttgga cggggcatca gttcttcatt tgtggagtgg 180 

agaaaagaag aggaaatctc tcatttgggg catttgaagg atggcttccc tgtttcatca 240 

gcttcagatc ctggtctgga aaaattggct aggtgtaaaa aggcagccgc tttggacact 300 

tgtcttgatc ttatggccag tcattatttt cataattttg gctattactc ggaccaaatt 360 

tcctccaact gcaaaaccaa cttgttacct cgcacctcga aaccttccta gtactggatt 420 

ctttccattc ctgcagaccc tactctgtga cacagactct aaatgcaaag acacacccta 480 

tggcccacaa gatctgcttc gtaggaaagg aattgatgat gcactattta aagacagtga 540 

gattctgaga aagtcatcca acctggataa ggacagcagt ttatcattcc agagcaccca 600 

agttccagaa agaaggcatg catcactagc cacagtattt cccagtccaa gttctgattt 660 

ggaaatcccc ggaacatata ctttcaatgg cagtcaagtg ctcgcacgaa ttcttggctt 720 

ggaaaagctg ttaaagcaaa attcaacttc agaagatata cgaagagaac tatgtgacag 780 

■ u ctattcagga tacattgtgg atgatgcctt ctcttggacc tttctaggaa gaaatgtttt 840 

0 taacaaattt tgcctttcta acatgaccct tttagagtct tctctccaag aactaaacaa 900 

Cj acagttctcc cagctatcca gtgaccccaa caatcagaag atagtgtttc aggaaatagt 960 

r :f cagaatgctg tctttcttct cacaagtgca agagcagaaa gctgtgtggc agcttctgtc 1020 

0 tagttttcca aatgtgtttc agaatgacac atcactaagc aatctatttg atgttcttcg 1080 

aaaggcaaac agtgtgctgc tggttgtgca gaaggtttat ccacgttttg caactaacga 1140 

El aggtttcaga accctccaga agtctgttaa acatctgctg tacactctgg actccccagc 1200 

il tcaaggtgac tccgataata taacgcatgt gtggaatgag gatgatggac agaccttatc 1260 

Q? tccaagcagt ctggctgcac agctcctaat tctggaaaac tttgaagatg ccctcttaaa 1320 

fy tatatcagca aatagtcctt atattcctta cttggcatgt gtgagaaatg tgactgacag 1380 

tttggccaga ggttcaccag aaaatctaag actcctgcag tccacaatac gatttaaaaa 1440 

atcttttctt cgcaatggtt cctatgaaga ttactttcct ccagttcctg aagtcctaaa 1500 

atcaaaactg tctcaacttc gaaacttgac cgaacttctt tgtgaatctg aaactttcag 1560 

tttgatagag aagtcatgcc agctctctga tatgagcttt gggagcctgt gtgaagaaag 1620 

tgagtttgat ctgcaactcc tcgaagcggc agagctgggc accgaaatag cagccagctt 1680 

actgtaccat gacaatgtca tatctaaaaa agtgagagat ttgctgactg gagatccaag 1740 

caaaattaat ttaaatatgg atcagtttct agaacaggca ctgcaaatga attacttgga 1800 

aaatatcact cagttaatac cgatcataga agccatgctg catgtcaata acagtgcaga 1860 

tgcttctgaa aagccaggtc agttactaga aatgtttaaa aatgttgaag agctgaaaga 1920 

agatttaagg agaacaacag gaatgtccaa caggactatt gacaagttgc tggccattcc 1980 

catccctgat aatagagctg agattatttc tcaggtgttc tggctgcatt cctgtgatac 2040 
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taatatcacc actcccaaac tagaagatgc aatgaaagaa ttctgcaacc tgtctctttc 2100 

agagagatcc cggcagtctt acctcatcgg actcaccctt ctgcactact taaacattta 2160 

caacttcaca gacaaggtgt ttttcccgag gaaagatcaa aagccagtag aaaagatgat 2220 

ggagctcttc ataagactaa aagagattct caatcagatg gcttctggca cacatccgct 2280 

gctagacaaa atgagatccc tgaagcaaat gcatctgccc agaagtgttc cattaacaca 2340 

ggcaatgtac agaagcaacc gaatgaacac accacaagga tcatttagca ccatctccca 2400 

agcattatgt tctcaaggaa ttaccactga atatttaact gccatgctgc cctcttccca 2460 

gaggccaaaa ggcaaccaca ccaaggattt tttgacttat aaattaacta aagagcaaat 2520 

tgcttcaaaa tatggaattc ccataaatac cacaccattt tgcttctccc tttataaaga 2580 

catcattaac atgcccgctg gacctgtgat ttgggctttc ttgaaaccta tgttgttggg 2640 

aagaattttg catgcaccat ataacccagt cacaaaggca ataatggaaa agtccaatgt 2700 

aactctgaga cagctggcgg aattaagaga aaaatctcaa gagtggatgg ataagtcgcc 2760 

acttttcatg aattccttcc atctgttaaa ccaggcaatt ccaatgctcc agaatactct 2820 

aaggaaccct tttgtgcaag tttttgtaaa gttctccgtg ggactcgatg ctgttgaact 2880 

attgaaacag atagatgaac tcgatattct aagactgaaa ttagagaaca acattgacat 2940 

catcgatcag cttaacacac tatcttccct gacagtaaat atttcctctt gtgtattata 3000 

tgaccgtatt caggcagcaa aaaccataga tgaaatggag agagaggcta aaaggctcta 3060 

caaaagcaac gaactctttg gaagtgttat ttttaagctt ccttctaaca gaagctggca 3120 

cagaggctat gactctggaa atgtctttct tcctcctgtc ataaaatata ccatccggat 3180 

gagtctcaag accgcacaga ccacaagaag cctaagaacc aagatttggg ctccagggcc 3240 

acacaattct ccatcacaca accagatcta tggcagggct tttatttatt tacaggatag 3300 

tattgaaaga gcaatcattg aattgcaaac tggaaggaac tcccaggaaa tagcagtcca 3360 

ggttcaagca attccttatc cctgcttcat gaaagacaac ttcctaacca gtgtctctta 3420 

ttctcttcca attgtgctta tggttgcctg ggttgtattt atagctgcct ttgtaaaaaa 3480 

gcttgtctat gagaaagacc tccggcttca tgagtacatg aagatgatgg gtgtgaactc 3540 

ctgcagccat ttctttgcct ggcttataga gagtgttgga tttttactgg ttaccatcgt 3600 

gatcctcatc attatactca agtttggcaa tattcttcct aaaacaaatg ggttcatttt 3660 

gttcctgtat ttttcggact acagcttctc ggttattgcc atgagctatc ttatcagtgt 3720 

cttcttcaac aacaccaaca ttgcagctct gatcggaagc ctcatctaca tcattgcctt 3780 

ctttccattt attgttctgg ttacagtgga gaatgagttg agctatgtat tgaaagtgtt 3840 

catgagcctg ctgtccccaa cagcattcag ctatgcaagc caatacattg cacgatacga 3900 

agaacagggc attggtcttc agtgggaaaa tatgtacacc tccccggttc aggatgacac 3960 
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cacctcattt ggctggctgt gctgtctaat cctagctgac tctttcattt atttccttat 4020 

tgcttggtat gtcaggaatg tcttcccagg gacatacggt atggcagctc cctggtattt 4080 

tccaattctt ccttcctatt ggaaggagcg atttgggtgt gcagaggtga agcctgagaa 4140 

gagcaatggc ctcatgttta ctaacatcat gatgcagaac accaacccat ctgccagtcc 4200 

tgaatacatg ttttcctcta acatcgagcc tgaacctaaa gatctcacag tcggggttgc 4260 

cctgcatggg gtcacaaaga tctatggctc aaaagttgct gttgataacc tcaatctgaa 4320 

cttttatgaa gggcatatta cttcattgct ggggcccaat ggagctggga aaactactac 4380 

catttccatg ttaactgggc tgtttggggc ctcagcaggc accatttttg tatatggaaa 4440 

agatatcaaa acagacctac acacggtacg gaagaacatg ggagtctgta tgcagcacga 4500 

cgtcttgttc agttacctca ctactaagga gcaccttctc ctatatggtt ccatcaaagt 4560 

tcctcactgg actaaaaagc agctccacga ggaagtaaaa aggactttaa aagatactgg 4620 

u actatatagc catcgtcata agagagttgg aacactgtca ggaggcatga agaggaagtt 4680 

P atctatatcc atagctctca ttggtggatc aagggtagta attttggatg aaccatctac 4740 

%l tggagttgac ccatgttctc gccgaagtat atgggatgtt atatccaaga acaaaactgc 4800 

'!? cagaacaatc attctgtcaa cgcaccactt ggacgaggct gaagtgctga gtgaccgcat 4860 

C? cgccttcctg gagcagggtg ggcttaggtg ctgtgggtcc ccattttacc tcaaggaagc 4920 

ctttggcgat gggtatcacc tcacgcttac caagaagaag agtccaaatt taaatgcaaa 4980 

CI tgcagtatgt gacaccatgg ccgtgacagc aatgatccaa tcacatctcc ccgaagccta 5040 

y cctcaaggag gatattgggg gagagcttgt ttatgtactt cctccattca gcaccaaagt 5100 

Rl ctcaggggcc tacctgtcac tcctacgggc actcgacaat ggcatgggtg acctcaacat 5160 

CI 

Rf cgggtgctac ggcatttcag ataccaccgt ggaggaggtc tttctgaact tgaccaaaga 5220 

gtcacaaaaa aatagtgcta tgagtcttga gcacttaaca caaaagaaaa ttgggaattc 5280 

caatgccaat ggcatctcaa ctcctgacga tttatctgtg agcagcagca atttcacaga 5340 

cagagatgac aaaatcctga caagaggaga gaggctggat ggctttggac tgttgctgaa 5400 

gaagatcatg gctatactca tcaagaggtt ccaccacrcc cgcaggaact ggaaaggtct 5460 

cattgctcag gttatcctcc ccatcgtctt tgttaccact gccatgggcc ttggcacact 5520 

gagaaattcc agcaacagtt atccagagat tcagatctcc ccctctcttt atggtacctc 5580 

cgaacagaca gccttctatg ctaattatca cccgagcacg gaagcacttg tctcagcaat 5640 

gtgggacttc cctggaattg acaacatgtg tctgaacacc agtgatctac agtgtttaaa 5700 

caaagacagt ctggaaaaat ggaacaccag tggagaaccc atcactaatt ttggtgtttg 5760 

ctcctgctca gaaaatgtcc aggaatgtcc taaatttaac tattccccac cgcacagaag 5820 

aacttactca tcccaggtaa tttataacct cactgggcaa cgagtggaaa attatcttat 5880 
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atcaactgca aatgagtttg tccaaaaaag atatggaggt tggagttttg ggctgccttt 5940 

gacaaaagac cttcgttttg atataacagg agtccctgcc aatagaacac ttgccaaggt 6000 

atggtatgat ccagaaggct atcactccct tccagcttac ctcaacagcc tgaataattt 6060 

ccttctgcga gttaacatgt caaaatacga tgctgcccga catggcatca tcatgtatag 6120 

ccatccttat ccaggagtgc aagaccaaga acaagccaca atcagcagtt taatcgatat 6180 

tttagtggca ctgtctatct tgatgggcta ctctgtcacc accgccagct ttgtcaccta 6240 

tgttgtaagg gaacatcaaa ccaaagccaa acagttgcag cacatttcag gcattggcgt 6300 

gacatgctac tgggtaacaa acttcattta tgacatggtt ttctacttgg tgcctgtagc 6360 

gttttcaatt ggtatcattg cgattttcaa attacctgca ttctacagtg aaaacaacct 6420 

aggcgctgta tctctcctac ttctcctgtt tgggcatgca acattttcct ggatgtactt 6480 

gctggctggg ctcttccatg aaacaggaat ggccttcatc acttacgtct gtgtcaactt 6540 

hh gttttttggc attaattcca ttgtttccct gtcagtggta tactttcttt ccaaggaaaa 6600 

y gcctaatgat ccgactttag aacttatttc tgaaaccctc aagcgcattt tcctgatttt 6660 

S\ cccacaattc tgttttggct acggtttgat tgaactttct caacaacagt cggtcctaga 6720 

ft I _ 

JJ5 cttcttaaaa gcatatggag tggaataccc aaatgaaacc tttgagatga ataaactagg 6780 

Q tgcaatgttt gtggctttgg tttctcaggg caccatgttt ttttccttgc gactcttaat 6840 

I caacgaatcc ctgataaaga aactcaggct tttcttcaga aaatttaatt cttcacatgt 6900 

?| aagggagaca atagatgagg atgaagatgt gcgggctgag agattaagag ttgagagtgg 6960 

U tgcagctgaa tttgacttgg tccaacttta ttgtctcaca aagacctacc aacttatcca 7020 

p i 

caaaaagatt atagctgtaa acaacatcag catcgggata cctgctggag agtgttttgg 7080 

ft) gcttcttgga gtgaatggag caggaaagac cactatattc aagatgctga caggagacat 7140 

cattccttca agtggaaaca ttctgatcag aaataagacc ggatctctgg gtcacgttga 7200 

ttctcacagc tcattagttg gctactgtcc tcaggaagat gccttagatg acctggtaac 7260 

tgtggaagaa catttgtatt tctatgccag ggtacatgga attccagaaa aggatattaa 7320 

agaaactgtt cataaactcc ttaggagact tcacctgatg cccttcaagg acagagctac 7380 

ctctatgtgc agttatggca caaaaagaaa attatccact gcactggcct tgatagggaa 7440 

accttccatt ctactgctgg atgagccgag ctctggcatg gatccgaagt cgaaacggca 7500 

cctctggaag atcatttcag aagaagtaca gaacaaatgt tccgtcatcc tcacatctca 7560 

cagcatggaa gaatgtgaag ctctctgtac caggttggcc attatggtga atggaaagtt 7620 

tcaatgtatt ggatctttgc agcacataaa gagcaggttt ggacgaggat ttactgtcaa 7680 

agttcacttg aagaataaca aagtgaccat ggagaccctc acaaagttca tgcagctgca 7740 

ctttccaaaa acatacttaa aagatcagca cctcagcatg ctagagtatc atgtaccagt 7800 
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cacagcagga ggagtcgcaa acatttttga 
tattacaaat ttcttagtga gtcagaccac 
agaccagaag tcctatgaaa ctgctgatac 
ctcacaagat gaccagatgg agtcttaaca 
caatggcttc attttgaaga aaagccacag 
taaagtaaag taatatactg tatggaaagt 
aaaaggaaat ttttccttct aaggtcagtg 
atactcaaca ctgtgagcat gctaatgtat 
cacctcaaga tgaatatctt aatttattac 
aaaaaaaaaa 

<210> 4 

h <211> 8113 

tl <212> DNA 

fi$ <213> Homo sapiens 

•Lr ■'■ 

CI <400> 4 

« gaagagttga ttgagaagtg cctcttggtt 
J* aaacagaaga actgtgggtt tcttacccca 
H tactgataca ctgacataca gcaagttgga 
ps agaaaagaag aggaaatctc tcatttgggg 
ill gcttcagatc ctggtctgga aaaattggct 
tgtcttgatc ttatggccag tcattatttt 
tcctccaact gcaaaaccaa cttgttacct 
ctttccattc ctgcagaccc tactctgtga 
tggcccacaa gatctgcttc gtaggaaagg 
gattctgaga aagtcatcca acctggataa 
agttccagaa agaaggcatg catcactagc 
ggaaatcccc ggaacatata ctttcaatgg 
ggaaaagctg ttaaagcaaa attcaacttc 
ctattcagga tacattgtgg atgatgcctt 
taacaaattt tgcctttcta acatgaccct 
acagttctcc cagctatcca gtgaccccaa 



tctgctggaa accaacaaga ctgctttaaa 7860 

tctggaagag gttttcatca actttgccaa 7920 

cagcagccaa ggttccacta taagtgttga 7980 

cttccagcaa actcaatctc agcgtgtgac 8040 

aagatacact tccgcaagat atcttcattt 8100 

tacaactgtg ttagactaac aagtaattat 8160 

agtgttgttg ctactgaaat gaattcctgt 8220 

atgctggtga ttcttatgca aaggtgaagc 8280 

tttcaataaa aagacagttt aaaaggcaaa 8340 

8350 



aaggattaac cacagggaaa aatccagcag 60 

gccctcaagg aagctatgcc gtgaaagggg 120 

cggggcatca gttcttcatt tgtggagtgg 180 

catttgaagg atggcttccc tgtttcatca 240 

aggtgtaaaa aggcagccgc tttggacact 300 

cataattttg gctattactc ggaccaaatt 360 

cgcacctcga aaccttccta gtactggatt 420 

cacagactct aaatgcaaag acacacccta 480 

aattgatgat gcactattta aagacagtga 540 

ggacagcagt ttatcattcc agagcaccca 600 

cacagtattt cccagtccaa gttctgattt 660 

cagtcaagtg ctcgcacgaa ttcttggctt 720 

agaagatata cgaagagaac tatgtgacag 780 

ctcttggacc tttctaggaa gaaatgtttt 840 

tttagagtct tctctccaag aactaaacaa 900 

caatcagaag atagtgtttc aggaaatagt 960 
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cagaatgctg tctttcttct cacaagtgca agagcagaaa gctgtgtggc agcttctgtc 1020 

tagttttcca aatgtgtttc agaatgacac atcactaagc aatctatttg atgttcttcg 1080 

aaaggcaaac agtgtgctgc tggttgtgca gaaggtttat ccacgttttg caactaacga 1140 

aggtttcaga accctccaga agtctgttaa acatctgctg tacactctgg actccccagc 1200 

tcaaggtgac tccgataata taacgcatgt gtggaatgag gatgatggac agaccttatc 1260 

tccaagcagt ctggctgcac agctcctaat tctggaaaac tttgaagatg ccctcttaaa 1320 

tatatcagca aatagtcctt atattcctta cttggcatgt gtgagaaatg tgactgacag 1380 

tttggccaga ggttcaccag aaaatctaag actcctgcag tccacaatac gatttaaaaa 1440 

atcttttctt cgcaatggtt cctatgaaga ttactttcct ccagttcctg aagtcctaaa 1500 

atcaaaactg tctcaacttc gaaacttgac cgaacttctt tgtgaatctg aaactttcag 1560 

tttgatagag aagtcatgcc agctctctga tatgagcttt gggagcctgt gtgaagaaag 1620 

u tgagtttgat ctgcaactcc tcgaagcggc agagctgggc accgaaatag cagccagctt 1680 

P actgtaccat gacaatgtca tatctaaaaa agtgagagat ttgctgactg gagatccaag 1740 

%i caaaattaat ttaaatatgg atcagtttct agaacaggca ctgcaaatga attacttgga 1800 

«§ 

*:[ aaatatcact cagttaatac cgatcataga agccatgctg catgtcaata acagtgcaga 1860 

%l 

jjj tgcttctgaa aagccaggtc agttactaga aatgtttaaa aatgttgaag agctgaaaga 1920 

f f agatttaagg agaacaacag gaatgtccaa caggactatt gacaagttgc tggccattcc 1980 

P catccctgat aatagagctg agattatttc tcaggtgttc tggctgcatt cctgtgatac 2040 

l Z :i 

taatatcacc actcccaaac tagaagatgc aatgaaagaa ttctgcaacc tgtctctttc 2100 

Rl agagagatcc cggcagtctt acctcatcgg actcaccctt ctgcactact taaacattta 2160 

Sj caacttcaca gacaaggtgt ttttcccgag gaaagatcaa aagccagtag aaaagatgat 2220 

ggagctcttc ataagactaa aagagattct caatcagatg gcttctggca cacatccgct 2280 

gctagacaaa atgagatccc tgaagcaaat gcatctgccc agaagtgttc cattaacaca 2340 

ggcaatgtac agaagcaacc gaatgaacac accacaagga tcatttagca ccatctccca 2400 

agcattatgt tctcaaggaa ttaccactga atatttaact gccatgctgc cctcttccca 2460 

gaggccaaaa ggcaaccaca ccaaggattt tttgacttat aaattaacta aagagcaaat 2520 

tgcttcaaaa tatggaattc ccataaatac cacaccattt tgcttctccc tttataaaga 2580 

catcattaac atgcccgctg gacctgtgat ttgggctttc ttgaaaccta tgttgttggg 2640 

aagaattttg catgcaccat ataacccagt cacaaaggca ataatggaaa agtccaatgt 2700 

aactctgaga cagctggcgg aattaagaga aaaatctcaa gagtggatgg ataagtcgcc 2760 

acttttcatg aattccttcc atctgttaaa ccaggcaatt ccaatgctcc agaatactct 2820 

aaggaaccct tttgtgcaag tttttgtaaa gttctccgtg ggactcgatg ctgttgaact 2880 
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attgaaacag atagatgaac tcgatattct aagactgaaa ttagagaaca acattgacat 2940 

catcgatcag cttaacacac tatcttccct gacagtaaat atttcctctt gtgtattata 3000 

tgaccgtatt caggcagcaa aaaccataga tgaaatggag agagaggcta aaaggctcta 3060 

caaaagcaac gaactctttg gaagtgttat ttttaagctt ccttctaaca gaagctggca 3120 

cagaggctat gactctggaa atgtctttct tcctcctgtc ataaaatata ccatccggat 3180 

gagtctcaag accgcacaga ccacaagaag cctaagaacc aagatttggg ctccagggcc 3240 

acacaattct ccatcacaca accagatcta tggcagggct tttatttatt tacaggatag 3300 

tattgaaaga gcaatcattg aattgcaaac tggaaggaac tcccaggaaa tagcagtcca 3360 

ggttcaagca attccttatc cctgcttcat gaaagacaac ttcctaacca gtgtctctta 3420 

ttctcttcca attgtgctta tggttgcctg ggttgtattt atagctgcct ttgtaaaaaa 3480 

gcttgtctat gagaaagacc tccggcttca tgagtacatg aagatgatgg gtgtgaactc 3540 

ctgcagccat ttctttgcct ggcttataga gagtgttgga tttttactgg ttaccatcgt 3600 

g gatcctcatc attatactca agtttggcaa tattcttcct aaaacaaatg ggttcatttt 3660 

Jf{ gttcctgtat ttttcggact acagcttctc ggttattgcc atgagctatc ttatcagtgt 3720 

ftf cttcttcaac aacaccaaca ttgcagctct gatcggaagc ctcatctaca tcattgcctt 3780 

fi ctttccattt attgttctgg ttacagtgga gaatgagttg agctatgtat tgaaagtgtt 3840 

0 catgagcctg ctgtccccaa cagcattcag ctatgcaagc caatacattg cacgatacga 3900 

Pt agaacagggc attggtcttc agtgggaaaa tatgtacacc tccccggttc aggatgacac 3960 

fl? cacctcattt ggctggctgt gctgtctaat cctagctgac tctttcattt atttccttat 4020 

fy tgcttggtat gtcaggaatg tcttcccagg gacatacggt atggcagctc cctggtattt 4080 

b? tccaattctt ccttcctatt ggaaggagcg atttgggtgt gcagaggtga agcctgagaa 4140 

ill 

gagcaatggc ctcatgttta ctaacatcat gatgcagaac accaacccat ctgccagtcc 4200 

tgaatacatg ttttcctcta acatcgagcc tgaacctaaa gatctcacag tcggggttgc 4260 

cctgcatggg gtcacaaaga tctatggctc aaaagttgct gttgataacc tcaatctgaa 4320 

cttttatgaa gggcatatta cttcattgct ggggcccaat ggagctggga aaactactac 4380 

catttccatg ttaactgggc tgtttggggc ctcagcaggc accatttttg tatatggaaa 4440 

agatatcaaa acagacctac acacggtacg gaagaacatg ggagtctgta tgcagcacga 4500 

cgtcttgttc agttacctca ctactaagga gcaccttctc ctatatggtt ccatcaaagt 4560 

tcctcactgg actaaaaagc agctccacga ggaagtaaaa aggactttaa aagatactgg 4620 

actatatagc catcgtcata agagagttgg aacactgtca ggaggcatga agaggaagtt 4680 

atctatatcc atagctctca ttggtggatc aagggtagta attttggatg aaccatctac 4740 

tggagttgac ccatgttctc gccgaagtat atgggatgtt atatccaaga acaaaactgc 4800 
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cagaacaatc 


attctgtcaa 


cgcaccactt 


ggacgaggct 


gaagtgctga gtgaccgcat 


4860 


cgccttcctg 


gagcagggtg 


ggcttaggtg 


ctgtgggtcc 


ccattttacc tcaaggaagc 


4920 


ctttggcgat 


gggtatcacc 


tcacgcttac 


caagaagaag 


gtctttctga acttgaccaa 


4980 


agagtcacaa 


aaaaatagtg 


ctatgagtct 


tgagcactta 


acacaaaaga aaattgggaa 


5040 


ttccaatgcc 


aatggcatct 


caactcctga 


cgatttatct 


gtgagcagca gcaatttcac 


5100 


agacagagat 


gacaaaatcc 


tgacaagagg 


agagaggctg 


gatggctttg gactgttgct 


5160 


gaagaagatc 


atggctatac 


tcatcaagag 


gttccaccac 


gcccgcagga actggaaagg 


5220 


tctcattgct 


caggttatcc 


tccccatcgt 


ctttgttacc 


actgccatgg gccttggcac 


5280 


actgagaaat 


tccagcaaca 


gttatccaga 


gattcagatc 


tccccctctc tttatggtac 


5340 


ctccgracag 


acagccttct 


atgctaatta 


tcacccgagc 


acggaagcac ttgtctcagc 


5400 


aatgtgggac 


ttccctggaa 


ttgacaacat 


gtgtctgaac 


accagtgatc tacagtgttt 


5460 


H aaacaaagac 


agtctggaaa 


aatggaacac 


cagtggagaa 


cccatcacta attttggtgt 


5520 


•S' ttgctcctgc 


tcagaaaatg 


tccaggaatg 


tcctaaattt 


aactattccc caccgcacag 


5580 


2l aagaacttac 


tcatcccagg 


taatttataa 


cctcactggg 


caacgagtgg aaaattatct 


5640 


yj| tatatcaact 


gcaaatgagt 


ttgtccaaaa 


aagatatgga 


ggttggagtt ttgggctgcc 


5700 


%l tttgacaaaa 


gaccttcgtt 


ttgatataac 


aggagtccct 


gccaatagaa cacttgccaa 


5760 


I ggtatggtat 


gatccagaag 


gctatcactc 


ccttccagct 


tacctcaaca gcctgaataa 


5820 


^ tttccttctg 


cgagttaaca 


tgtcaaaata 


cgatgctgcc 


cgacatggca tcatcatgta 


5880 


H tagccatcct 


tatccaggag 


tgcaagacca 


agaacaagcc 


acaatcagca gtttaatcga 


5940 


rj tattttagtg 


gcactgtcta 


tcttgatggg 


ctactctgtc 


accaccgcca gctttgtcac 


6000 


K ctatgttgta 


agggaacatc 


aaaccaaagc 


caaacagttg 


cagcacattt caggcattgg 


6060 


cgtgacatgc 


tactgggtaa 


caaacttcat 


ttatgacatg 


gttttctact tggtgcctgt 


6120 


agcgttttca 


attggtatca 


ttgcgatttt 


caaattacct 


gcattctaca gtgaaaacaa 


6180 


cctaggcgct 


gtatctctcc 


tacttctcct 


gtttgggcat 


gcaacatttt cctggatgta 


6240 


cttgctggct 


gggctcttcc 


atgaaacagg 


aatggccttc 


atcacttacg tctgtgtcaa 


6300 


cttgtttttt 


ggcattaatt 


ccattgtttc 


cctgtcagtg 


gtatactttc tttccaagga 


6360 


aaagcctaat 


gatccgactt 


tagaacttat 


ttctgaaacc 


ctcaagcgca ttttcctgat 


6420 


tttcccacaa 


ttctgttttg 


gctacggttt 


gattgaactt 


tctcaacaac agtcggtcct 


6480 


agacttctta 


aaagcatatg 


gagtggaata 


cccaaatgaa 


acctttgaga tgaataaact 


6540 


aggtgcaatg 


tttgtggctt 


tggtttctca 


gggcaccatg 


tttttttcct tgcgactctt 


6600 


aatcaacgaa 


tccctgataa 


agaaactcag 


gcttttcttc 


agaaaattta attcttcaca 


6660 


tgtaagggag 


acaatagatg 


aggatgaaga 


tgtgcgggct 


gagagattaa gagttgagag 


6720 
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tggtgcagct gaatttgact tggtccaact ttattgtctc acaaagacct accaacttat 6780 

ccacaaaaag attatagctg taaacaacat cagcatcggg atacctgctg gagagtgttt 6840 

tgggcttctt ggagtgaatg gagcaggaaa gaccactata ttcaagatgc tgacaggaga 6900 

catcattcct tcaagtggaa acattctgat cagaaataag accggatctc tgggtcacgt 6960 

tgattctcac agctcattag ttggctactg tcctcaggaa gatgccttag atgacctggt 7020 

aactgtggaa gaacatttgt atttctatgc cagggtacat ggaattccag aaaaggatat 7080 

taaagaaact gttcataaac tccttaggag acttcacctg atgcccttca aggacagagc 7140 

tacctctatg tgcagttatg gcacaaaaag aaaattatcc actgcactgg ccttgatagg 7200 

gaaaccttcc attctactgc tggatgagcc gagctctggc atggatccga agtcgaaacg 7260 

gcacctctgg aagatcattt cagaagaagt acagaacaaa tgttccgtca tcctcacatc 7320 

tcacagcatg gaagaatgtg aagctctctg taccaggttg gccattatgg tgaatggaaa 7380 

jU gtttcaatgt attggatctt tgcagcacat aaagagcagg tttggacgag gatttactgt 7440 

caaagttcac ttgaagaata acaaagtgac catggagacc ctcacaaagt tcatgcagct 7500 

y 

\! gcactttcca aaaacatact taaaagatca gcacctcagc atgctagagt atcatgtacc 7560 

m 

% agtcacagca ggaggagtcg caaacatttt tgatctgctg gaaaccaaca agactgcttt 7620 

'%$ 

O aaatattaca aatttcttag tgagtcagac cactctggaa gaggttttca tcaactttgc 7680 

f 1 caaagaccag aagtcctatg aaactgctga taccagcagc caaggttcca ctataagtgt 7740 

W tgactcacaa gatgaccaga tggagtctta acacttccag caaactcaat ctcagcgtgt 7800 

jbi* gaccaatggc ttcattttga agaaaagcca cagaagatac acttccgcaa gatatcttca 7860 

y ttttaaagta aagtaatata ctgtatggaa agttacaact gtgttagact aacaagtaat 7920 

|| tataaaagga aatttttcct tctaaggtca gtgagtgttg ttgctactga aatgaattcc 7980 

tgtatactca acactgtgag catgctaatg tatatgctgg tgattcttat gcaaaggtga 8040 

agccacctca agatgaatat cttaatttat tactttcaat aaaaagacag tttaaaaggc 8100 

aaaaaaaaaa aaa 8113 

<210> 5 
<211> 2595 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> xaa 

<222> (1)..(2595) 
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<223> xaa = any amino acid 



<400> 5 

Met Ala ser Leu Phe His Gin Leu Gin lie Leu val Trp Lys Asn Trp 
15 10 15 

Leu Gly val Lys Arg Gin Pro Leu Trp Thr Leu Val Leu lie Leu Trp 
20 25 30 

Pro val lie lie Phe lie lie Leu Ala lie Thr Arg Thr Lys Phe Pro 
35 40 45 

pro Thr Ala Lys Pro Thr Cys Tyr Leu Ala Pro Arg Asn Leu Pro Ser 
50 55 ' 60 

Thr Gly Phe Phe Pro Phe Leu Gin Thr Leu Leu Cys Asp Thr Asp Ser 
65 70 75 80 

Lys Cys Lys Asp Thr Pro Tyr Gly Pro Gin Asp Leu Leu Arg Arg Lys 
N ' 85 90 95 

;S Gly lie Asp Asp Ala Leu Phe Lys Asp Ser Glu lie Leu Arg Lys Ser 

H 100 105 110 

"■»{ 

ft! ser Asn Leu Asp Lys Asp Ser Ser Leu Ser Phe Gin Ser Thr Gin Val 

rf| 115 120 125 

~f Pro Glu Arg Arg His Ala Ser Leu Ala Thr val Phe Pro Ser Pro Ser 
W 130 135 140 

m Ser Asp Leu Glu lie Pro Gly Thr Tyr Thr Phe Asn Gly Ser Gin Val 
h 145 150 155 160 

Leu Ala Arg lie Leu Gly Leu Glu Lys Leu Leu Lys Gin Asn Ser Thr 

fll 165 ' 170 ' 175 

O 

'ik\ ser Glu Asp lie Arg Arg Glu Leu Cys Asp Ser Tyr Ser Gly Tyr lie 
m 180 185 190 

val Asp Asp Ala Phe ser Trp Thr Phe Leu Gly Arg Asn Val Phe Asn 
195 200 205 

Lys Phe Cys Leu ser Asn Met Thr Leu Leu Glu Ser Ser Leu Gin Glu 
210 215 220 

Leu Asn Lys Gin Phe ser Gin Leu ser ser Asp Pro Asn Asn Gin Lys 
225 230 235 240 

lie val Phe Gin Glu lie val Arg Met Leu Ser Phe Phe Ser Gin val 
245 250 255 

Gin Glu Gin Lys Ala val Trp Gin Leu Leu Ser Ser Phe Pro Asn Val 
260 265 270 

Phe Gin Asn Asp Thr Ser Leu ser Asn Leu Phe Asp Val Leu Arg Lys 
275 280 285 

Ala Asn ser val Leu Leu val val Gin Lys val Tyr Pro Arg Phe Ala 
290 295 " 300 
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Thr Asn Glu Gly Phe Arg Thr Leu Gin Lys Ser val Lys His Leu Leu 
305 310 315 320 

Tyr Thr Leu Asp Ser Pro Ala Gin Gly Asp Ser Asp Asn lie Thr His 
325 330 335 

val Trp Asn Glu Asp Asp Gly Gin Thr Leu ser Pro ser ser Leu Ala 
340 345 350 

Ala Gin Leu Leu lie Leu Glu Asn Phe Glu Asp Ala Leu Leu Asn lie 
355 360 365 

Ser Ala Asn Ser Pro Tyr lie Pro Tyr Leu Ala Cys val Arg Asn val 
370 375 380 

Thr Asp Ser Leu Ala Arg Gly Ser Pro Glu Asn Leu Arg Leu Leu Gin 
385 390 395 ~ 400 

Ser Thr lie Arg Phe Lys Lys Ser Phe Leu Arg Asn Gly Ser Tyr Glu 
405 410 " 415 

Asp Tyr Phe Pro Pro Val Pro Glu Val Leu Lys Ser Lys Leu Ser Gin 
420 425 430 

Leu Arg Asn Leu Thr Glu Leu Leu Cys Glu Ser Glu Thr Phe Ser Leu 
435 440 445 

lie Glu Lys Ser Cys Gin Leu Ser Asp Met Ser Phe Gly ser Leu Cys 
450 455 460 

Glu Glu Ser Glu Phe Asp Leu Gin Leu Leu Glu Ala Ala Glu Leu Gly 
465 470 475 480 

Thr Glu lie Ala Ala Ser Leu Leu Tyr His Asp Asn Val lie Ser Lys 
485 490 495 

Lys Val Arg Asp Leu Leu Thr Gly Asp Pro Ser Lys lie Asn Leu Asn 
500 505 510 

Met Asp Gin Phe Leu Glu Gin Ala Leu Gin Met Asn Tyr Leu Glu Asn 
515 520 525 

lie Thr Gin Leu lie Pro lie lie Glu Ala Met Leu His Val Asn Asn 
530 535 540 

Ser Ala Asp Ala Ser Glu Lys Pro Gly Gin Leu Leu Glu Met Phe Lys 
545 550 555 560 

Asn val Glu Glu Leu Lys Glu Asp Leu Arg Arg Thr Thr Gly Met Ser 
565 570 ~ 575 

Asn Arg Thr lie Asp Lys Leu Leu Ala lie Pro lie Pro Asp Asn Arg 
580 585 590 

Ala Glu He lie ser Gin val Phe Trp Leu His ser cys Asp Thr Asn 
595 600 605 

He Thr Thr Pro Lys Leu Glu Asp Ala Met Lys Glu Phe Cys Asn Leu 
610 615 620 

ser Leu Ser Glu Arg ser Arg Gin ser Tyr Leu He Gly Leu Thr Leu 
625 630 635 640 

Leu His Tyr Leu Asn He Tyr Asn Phe Thr Asp Lys val Phe Phe Pro 
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Arg Lys Asp Gin 
660 

Leu Lys Glu lie 
675 

Asp Lys Met Arg 
690 

Leu Thr Gin Ala 
705 

Ser Phe Ser Thr 



645 

Lys Pro val 



650 

Glu Lys Met Met Glu Leu 
665 



Leu Asn Gin Met Ala Ser Gly Thr His 

680 685 

Ser Leu Lys Gin Met His Leu Pro Arg 

695 700 



Glu Tyr 
His Thr 



Leu Thr 
740 

Lys Asp 
755 



I 



Tyr Gly 
Asp lie 
Leu Lys Pro Met 



3 Ser Lys 

J 770 

"\ Tyr Lys 
\ 785 



1 val Thr 
j Ala Glu 
j Phe Met Asn Ser 



Lys Ala 
820 

Leu Arg 

835 



850 



J Asn Thr 
865 



Leu Arg 
Gly Leu Asp Ala 



Leu Arg Leu Lys 
900 

Thr Leu Ser Ser 
915 

Arg lie Gin Ala 
930 

Arg Leu Tyr Lys 
945 

Pro ser Asn Arg 



Leu Pro Pro Val 
980 



Met Tyr Arg 
710 

lie Ser Gin 
725 

Ala Met Leu 

Phe Leu Thr 

lie Pro lie 
775 

lie Asn Met 
790 

Leu Leu Gly 
805 

lie Met Glu 

Glu Lys Ser 

Phe His Leu 
855 

Asn Pro Phe 
870 

Val Glu Leu 
885 

Leu Glu Asn 
Leu Thr val 



Ala Lys Thr 
935 

Ser Asn Glu 
950 

Ser Trp His 
965 



Ser Asn Arg Met Asn Thr 
715 

Ala Leu Cys Ser Gin Gly 
730 

Pro Ser Ser Gin Arg Pro 
745 

Tyr Lys Leu Thr Lys Glu 
760 765 

Asn Thr Thr Pro Phe Cys 
780 

Pro Ala Gly Pro val lie 
795 

Arg lie Leu His Ala Pro 
810 

Lys Ser Asn Val Thr Leu 
825 

Gin Glu Trp Met Asp Lys 
840 845 

Leu Asn Gin Ala lie Pro 
860 

Val Gin val Phe val Lys 
875 

Leu Lys Gin lie Asp Glu 
890 

Asn lie Asp lie lie Asp 
905 

Asn lie Ser Ser Cys Val 
920 925 

lie Asp Glu Met Glu Arg 
940 

Leu Phe Gly Ser val lie 
955 

Arg Gly Tyr Asp Ser Gly 
970 



lie Lys Tyr Thr lie Arg Met Ser Leu 
985 
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655 

Phe lie Arg 
670 

Pro Leu Leu 

Ser Val Pro 

Pro Gin Gly 
720 

lie Thr Thr 
735 

Lys Gly Asn 
750 

Gin lie Ala 

Phe Ser Leu 

Trp Ala Phe 
800 

Tyr Asn Pro 
815 

Arg Gin Leu 
830 

Ser Pro Leu 

Met Leu Gin 

Phe Ser val 
880 

Leu Asp lie 
895 

Gin Leu Asn 
910 

Leu Tyr Asp 
Glu Ala Lys 



Phe Lys Leu 
960 

Asn val Phe 
975 

Lys Thr Ala 
990 



Gin Thr Thr Arg Ser Leu Arg Thr Lys lie Trp Ala Pro Gly Pro His 
995 1000 1005 

Asn Ser Pro Ser His Asn Gin lie Tyr Gly Arg Ala Phe lie Tyr 
1010 1015 1020 

Leu Gin Asp ser lie Glu Arg Ala lie lie Glu Leu Gin Thr Gly 
1025 1030 1035 

Arg Asn Ser Gin Glu He Ala Val Gin val Gin Ala lie Pro Tyr 
1040 1045 1050 

Pro Cys Phe Met Lys Asp Asn Phe Leu Thr Ser Val Ser Tyr Ser 
1055 1060 1065 

Leu Pro lie val Leu Met val Ala Trp val val Phe lie Ala Ala 
1070 1075 1080 

Phe val Lys Lys Leu val Tyr Glu Lys Asp Leu Arg Leu His Glu 
1085 ' 1090 1095 

- Tyr Met Lys Met Met Gly val Asn Ser Cys Ser His Phe Phe Ala 
N lioo 1105 1110 

K Trp Leu lie Glu Ser val Gly Phe Leu Leu Val Thr lie val lie 
J 1115 1120 1125 

01 Leu lie lie lie Leu Lys Phe Gly Asn lie Leu Pro Lys Thr Asn 



1130 



1135 



1140 



p 



Gly Phe lie Leu Phe Leu Tyr Phe ser Asp Tyr Ser Phe ser val 
1145 1150 1155 



r! lie Ala Met ser Tyr Leu lie ser val Phe Phe Asn Asn Thr Asn 
jiy 1160 1165 1170 

H He Ala Ala Leu lie Gly ser Leu He Tyr lie lie Ala Phe Phe 
rtf 1175 1180 1185 

S| Pro Phe lie Val Leu val Thr Val Glu Asn Glu Leu Ser Tyr Val 



1190 



1195 



1200 



Leu Lys Val Phe Met Ser Leu Leu Ser Pro Thr Ala Phe ser Tyr 
1205 1210 1215 

Ala Ser Gin Tyr lie Ala Arg Tyr Glu Glu Gin Gly lie Gly Leu 
1220 ' 1225 1230 

Gin Trp Glu Asn Met Tyr Thr Ser Pro Val Gin Asp Asp Thr Thr 
1235 1240 1245 

Ser Phe Gly Trp Leu Cys Cys Leu lie Leu Ala Asp Ser Phe lie 
1250 1255 1260 

Tyr Phe Leu lie Ala Trp Tyr Val Arg Asn val Phe Pro Gly Thr 
1265 1270 1275 

Tyr Gly Met Ala Ala Pro Trp Tyr Phe Pro lie Leu Pro Ser Tyr 
1280 1285 1290 

Trp Lys Glu Arg Phe Gly Cys Ala Glu val Lys Pro Glu Lys Ser 
1295 ~ 1300 1305 
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Asn 


Gly 
1310 


Leu 


Met 


Phe 


Thr Asn 
1315 


He 


Met 


Met Gin Asn 
1320 


Thr 


Asn Pro 


ser 


Ala 
1325 


Ser 


Pro 


Glu 


Tyr 


Met 
1330 


Phe 


Ser 


Ser Asn lie 
1335 


Glu 


Pro Glu 


Pro 


Lys 
1340 


Asp 


Leu 


Thr 


val 


Gly 
1345 


val 


Ala 


Leu His Gly 
1350 


val 


Thr Lys 


lie 


Tyr 
1355 


Gly 


Ser 


Lys 


val 


Ala 
1360 


val 


Asp 


Asn Leu Asn 
1365 


Leu 


Asn Phe 


Tyr 


Glu 
1370 


Gly 


His 


He 


Thr 


Ser 
1375 


Leu 


Leu 


Gly Pro Asn 
1380 


Gly 


Ala Gly 


Lys 


Thr 
1385 


Thr 


Thr 


He 


ser 


Met 
1390 


Leu 


Thr 


Gly Leu Phe 
1395 


Gly 


Ala Ser 


Ala 


Gly 
1400 


Thr 


He 


Phe 


val 


Tyr 
1405 


Gly 


Lys 


Asp lie Lys 
1410 


Thr 


Asp Leu 


His 


Thr 
1415 


val 


Arg 


Lys 


Asn 


Met 
1420 


Gly 


val 


Cys Met Gin 
1425 


His 


Asp Val 


~? Leu 


Phe 
1430 


Ser 


Tyr 


Leu 


Thr 


Thr 
1435 


Lys 


Glu 


His Leu Leu 
1440 


Leu 


Tyr Gly 


ff] Ser 


He 
1445 


Lys 


Val 


Pro 


His Trp 
1450 


Thr 


Lys 


Lys Gin Leu 
1455 


His 


Glu Glu 


Oval 


Lys 
1460 


Arg 


Thr 


Leu 


Lys 


Asp 
1465 


Thr 


Gly 


Leu Tyr Ser 
1470 


His 


Arg His 


i Lys 


Arg 
1475 


val 


Gly 


Thr 


Leu 


Ser 
1480 


Gly 


Gly 


Met Lys Arg 
1485 


Lys 


Leu Ser 


U lie 

•' M 


Ser 
1490 


He 


Ala 


Leu 


lie Gly 
1495 


Gly 


Ser 


Arg val Val 
1500 


He 


Leu Asp 


C| Gl U 

'jil 

7, .... 


Pro 
1505 


Ser 


Thr 


Gly 


Val 


Asp 
1510 


Pro 


Cys 


Ser Arg Arg 
1515 


Ser 


lie Trp 


Asp 


val 
1520 


lie 


Ser 


Lys 


Asn 


Lys 
1525 


Thr 


Ala 


Arg Thr lie 
1530 


He 


Leu Ser 


Thr 


His 
1535 


His 


Leu 


Asp 


Glu Ala 
1540 


Glu 


val 


Leu Ser Asp 
1545 


Arg 


lie Ala 


Phe 


Leu 
1550 


Glu 


Gin 


Gly 


Gly Leu 
1555 


Arg 


Cys 


Cys Gly Ser 
1560 


Pro 


Phe Tyr 


Leu 


Lys 
1565 


Glu 


Ala 


Phe 


Gly Asp 
1570 


Gly 


Tyr 


His Leu Thr 
1575 


Leu 


Thr Lys 


Lys 


Lys 
1580 


Ser 


Pro 


Asn 


Leu 


Asn 
1585 


Ala 


Asn 


Ala Val Cys 
1590 


Asp 


Thr Met 


Ala 


val 
1595 


Thr 


Ala 


Met 


He 


Gin 
1600 


Ser 


His 


Leu Pro Glu 
1605 


Ala 


Tyr Leu 


Lys 


Glu 
1610 


Asp 


lie 


Gly 


Gly Glu 
1615 


Leu 


val 


Tyr Val Leu 
1620 


Pro 


Pro Phe 


Ser 


Thr 


Lys 


val 


Ser 


Gly Ala 


Tyr 


Leu 


Ser Leu Leu 
25 


Arg 


Ala Leu 



1625 



1630 



1635 



Asp Asn Gly Met Gly Asp Leu Asn lie Gly Cys Tyr Gly lie Ser 
1640 1645 1650 

Asp Thr Thr Val Glu Glu Val Phe Leu Asn Leu Thr Lys Glu Ser 
1655 1660 1665 

Gin Lys Asn Ser Ala Met Ser Leu Glu His Leu Thr Gin Lys Lys 
1670 1675 1680 

lie Gly Asn Ser Asn Ala Asn Gly lie Ser Thr Pro Asp Asp Leu 
1685 1690 1695 

Ser val Ser ser Ser Asn Phe Thr Asp Arg Asp Asp Lys He Leu 
1700 1705 1710 

Thr Arg Gly Glu Arg Leu Asp Gly Phe Gly Leu Leu Leu Lys Lys 
1715 " 1720 " 1725 

lie Met Ala He Leu lie Lys Arg Phe His His xaa Arg Arg Asn 
1730 1735 1740 

Trp Lys Gly Leu lie Ala Gin Val lie Leu Pro lie val Phe val 
1745 1750 1755 

Thr Thr Ala Met Gly Leu Gly Thr Leu Arg Asn Ser Ser Asn Ser 
1760 1765 1770 

Tyr Pro Glu lie Gin lie Ser Pro Ser Leu Tyr Gly Thr Ser Glu 
1775 1780 1785 

Gin Thr Ala Phe Tyr Ala Asn Tyr His Pro Ser Thr Glu Ala Leu 
1790 1795 1800 

val Ser Ala Met Trp Asp Phe Pro Gly lie Asp Asn Met cys Leu 
1805 1810 1815 

Asn Thr ser Asp Leu Gin Cys Leu Asn Lys Asp Ser Leu Glu Lys 
1820 1825 1830 

Trp Asn Thr ser Gly Glu Pro lie Thr Asn Phe Gly val Cys Ser 
1835 1840 1845 

cys Ser Glu Asn val Gin Glu Cys Pro Lys Phe Asn Tyr Ser Pro 
1850 1855 1860 

Pro His Arg Arg Thr Tyr Ser Ser Gin Val lie Tyr Asn Leu Thr 
1865 ~ 1870 1875 

Gly Gin Arg val Glu Asn Tyr Leu lie Ser Thr Ala Asn Glu Phe 
1880 1885 1890 

Val Gin Lys Arg Tyr Gly Gly Trp Ser Phe Gly Leu Pro Leu Thr 
1895 " ' 1900 1905 

Lys Asp Leu Arg Phe Asp lie Thr Gly val Pro Ala Asn Arg Thr 
1910 " 1915 1920 

Leu Ala Lys val Trp Tyr Asp Pro Glu Gly Tyr His Ser Leu Pro 
1925 1930 1935 

Ala Tyr Leu Asn Ser Leu Asn Asn Phe Leu Leu Arg val Asn Met 
1940 1945 1950 
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Ser Lys Tyr Asp Ala Ala Arg His Gly lie lie Met Tyr ser His 
1955 I960 1965 

pro Tyr Pro Gly Val Gin Asp Gin Glu Gin Ala Thr lie Ser ser 
1970 1975 1980 

Leu lie Asp lie Leu Val Ala Leu Ser lie Leu Met Gly Tyr ser 
1985 1990 1995 

val Thr Thr Ala ser Phe val Thr Tyr val val Arg Glu His Gin 
2000 2005 2010 

Thr Lys Ala Lys Gin Leu Gin His lie Ser Gly lie Gly Val Thr 
2015 2020 2025 

cys Tyr Trp val Thr Asn Phe lie Tyr Asp Met val Phe Tyr Leu 
2030 2035 * 2040 

val Pro val Ala Phe ser lie Gly He lie Ala lie Phe Lys Leu 
2045 2050 " 2055 

Pro Ala Phe Tyr Ser Glu Asn Asn Leu Gly Ala val Ser Leu Leu 
M 2060 * 2065 2070 

51 Leu Leu Leu Phe Gly His Ala Thr Phe Ser Trp Met Tyr Leu Leu 
y 2075 2080 2085 

fU Ala Gly Leu Phe His Glu Thr Gly Met Ala Phe lie Thr Tyr val 
2090 2095 2100 

5- Cys val Asn Leu Phe Phe Gly lie Asn Ser lie Val Ser Leu Ser 
M 2105 2110 2115 

val val Tyr Phe Leu Ser Lys Glu Lys Pro Asn Asp Pro Thr Leu 
S 2120 2125 2130 

H Glu Leu lie Ser Glu Thr Leu Lys Arg lie Phe Leu lie Phe Pro 
fl| 2135 2140 2145 

y Gin Phe Cys Phe Gly Tyr Gly Leu lie Glu Leu Ser Gin Gin Gin 
W 2150 2155 2160 

Ser val Leu Asp Phe Leu Lys Ala Tyr Gly val Glu Tyr Pro Asn 
2165 2170 2175 

Glu Thr Phe Glu Met Asn Lys Leu Gly Ala Met Phe val Ala Leu 
2180 2185 " 2190 

val Ser Gin Gly Thr Met Phe Phe Ser Leu Arg Leu Leu lie Asn 
2195 2200 2205 

Glu Ser Leu lie Lys Lys Leu Arg Leu Phe Phe Arg Lys Phe Asn 
2210 2215 2220 

ser ser His val Arg Glu Thr lie Asp Glu Asp Glu Asp val Arg 
2225 2230 2235 

Ala Glu Arg Leu Arg val Glu Ser Gly Ala Ala Glu Phe Asp Leu 
2240 " 2245 2250 

Val Gin Leu Tyr Cys Leu Thr Lys Thr Tyr Gin Leu lie His Lys 
2255 2260 2265 
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Lys lie lie Ala val Asn Asn lie Ser lie Gly lie Pro Ala Gly 
2270 2275 2280 

Glu Cys Phe Gly Leu Leu Gly val Asn Gly Ala Gly Lys Thr Thr 
2285 2290 2295 

lie Phe Lys Met Leu Thr Gly Asp lie lie Pro Ser Ser Gly Asn 
2300 2305 2310 

lie Leu lie Arg Asn Lys Thr Gly Ser Leu Gly His Val Asp Ser 
2315 2320 2325 

His Ser Ser Leu val Gly Tyr cys Pro Gin Glu Asp Ala Leu Asp 
2330 2335 2340 

Asp Leu val Thr val Glu Glu His Leu Tyr Phe Tyr Ala Arg val 
2345 2350 2355 

His Gly lie Pro Glu Lys Asp lie Lys Glu Thr Val His Lys Leu 
2360 2365 2370 

Leu Arg Arg Leu His Leu Met Pro Phe Lys Asp Arg Ala Thr Ser 
2375 2380 2385 

I Met Cys Ser Tyr Gly Thr Lys Arg Lys Leu Ser Thr Ala Leu Ala 
2390 " 2395 2400 

| Leu lie Gly Lys Pro Ser lie Leu Leu Leu Asp Glu Pro Ser Ser 
2405 2410 2415 

' Gly Met Asp Pro Lys Ser Lys Arg His Leu Trp Lys lie lie Ser 
) 2420 2425 2430 

Glu Glu val Gin Asn Lys Cys Ser val lie Leu Thr Ser His Ser 
2435 2440 2445 



i, Met Glu Glu cys Glu Ala Leu cys Thr Arg Leu Ala lie Met val 
2450 2455 2460 

Asn Gly Lys Phe Gin Cys lie Gly Ser Leu Gin His lie Lys Ser 
2465 2470 2475 

Arg Phe Gly Arg Gly Phe Thr Val Lys Val His Leu Lys Asn Asn 
2480 ' 2485 2490 

Lys val Thr Met Glu Thr Leu Thr Lys Phe Met Gin Leu His Phe 
2495 2500 2505 

Pro Lys Thr Tyr Leu Lys Asp Gin His Leu Ser Met Leu Glu Tyr 
2510 2515 2520 

His val Pro val Thr Ala Gly Gly val Ala Asn lie Phe Asp Leu 
2525 2530 2535 

Leu Glu Thr Asn Lys Thr Ala Leu Asn lie Thr Asn Phe Leu val 
2540 2545 2550 

Ser Gin Thr Thr Leu Glu Glu val Phe lie Asn Phe Ala Lys Asp 
2555 2560 2565 

Gin Lys Ser Tyr Glu Thr Ala Asp Thr Ser Ser Gin Gly Ser Thr 
2570 2575 2580 

lie Ser Val Asp Ser Gin Asp Asp Gin Met Glu Ser 
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2585 2590 2595 

<210> 6 

<211> 2516 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> xaa 

<222> (1)..(2516) 

<223> Xaa = any amino acid 



H <400> 6 

% Met Ala Ser Leu Phe His Gin Leu Gin lie Leu val Trp Lys Asn Trp 
Hi 5 10 15 

¥11 Leu Gly val Lys Arg Gin Pro Leu Trp Thr Leu Val Leu lie Leu Trp 

In 20 25 30 



u Pro val lie lie Phe lie lie Leu Ala lie Thr Arg Thr Lys Phe Pro 
P 35 40 45 

Pi Pro Thr Ala Lys Pro Thr Cys Tyr Leu Ala Pro Arg Asn Leu Pro Ser 

]K' 50 55 60 

N Thr Gly Phe Phe Pro Phe Leu Gin Thr Leu Leu Cys Asp Thr Asp Ser 

f|| 65 70 75 80 

hi Lys Cys Lys Asp Thr Pro Tyr Gly Pro Gin Asp Leu Leu Arg Arg Lys 

85 90 95 

Gly lie Asp Asp Ala Leu Phe Lys Asp Ser Glu lie Leu Arg Lys Ser 
100 105 110 

Ser Asn Leu Asp Lys Asp Ser Ser Leu Ser Phe Gin Ser Thr Gin Val 
115 120 125 

Pro Glu Arg Arg His Ala ser Leu Ala Thr val Phe Pro ser Pro ser 

130 135 140 

Ser Asp Leu Glu lie Pro Gly Thr Tyr Thr Phe Asn Gly Ser Gin val 
145 150 155 160 

Leu Ala Arg lie Leu Gly Leu Glu Lys Leu Leu Lys Gin Asn Ser Thr 
165 170 * 175 

ser Glu Asp lie Arg Arg Glu Leu Cys Asp ser Tyr Ser Gly Tyr lie 
180 " 185 190 

val Asp Asp Ala Phe Ser Trp Thr Phe Leu Gly Arg Asn Val Phe Asn 
195 200 205 

Lys Phe Cys Leu Ser Asn Met Thr Leu Leu Glu Ser Ser Leu Gin Glu 
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210 



215 



220 



Leu Asn Lys Gin Phe Ser Gin Leu Ser Ser Asp Pro Asn Asn Gin Lys 
225 230 235 240 

He val Phe Gin Glu He val Arg Met Leu Ser Phe Phe ser Gin val 
245 " 250 255 

Gin Glu Gin Lys Ala val Trp Gin Leu Leu Ser Ser Phe Pro Asn val 
260 265 270 

Phe Gin Asn Asp Thr Ser Leu Ser Asn Leu Phe Asp val Leu Arg Lys 
275 280 285 

Ala Asn ser val Leu Leu val Val Gin Lys val Tyr Pro Arg Phe Ala 
290 295 300 

Thr Asn Glu Gly Phe Arg Thr Leu Gin Lys Ser val Lys His Leu Leu 
305 310 315 320 

Tyr Thr Leu Asp Ser Pro Ala Gin Gly Asp Ser Asp Asn lie Thr His 
325 330 335 

U val Trp Asn Glu Asp Asp Gly Gin Thr Leu Ser Pro Ser Ser Leu Ala 

q 340 ' 345 350 

R Ala Gin Leu Leu lie Leu Glu Asn Phe Glu Asp Ala Leu Leu Asn lie 

%! 355 360 365 



1 ser Ala Asn ser Pro Tyr lie Pro Tyr Leu Ala Cys val Arg Asn val 
370 375 380 



O Thr Asp Ser Leu Ala Arg Gly Ser Pro Glu Asn Leu Arg Leu Leu Gin 
,385 390 " 395 400 

W Ser Thr lie Arg Phe Lys Lys Ser Phe Leu Arg Asn Gly Ser Tyr Glu 

111 405 410 415 



hi Asp Tyr Phe Pro Pro Val Pro Glu val Leu Lys Ser Lys Leu Ser Gin 
K 420 425 430 

M 

Rf Leu Arg Asn Leu Thr Glu Leu Leu Cys Glu Ser Glu Thr Phe Ser Leu 
435 440 445 

lie Glu Lys Ser Cys Gin Leu Ser Asp Met Ser Phe Gly Ser Leu Cys 
450 455 460 

Glu Glu ser Glu Phe Asp Leu Gin Leu Leu Glu Ala Ala Glu Leu Gly 
465 470 475 480 

Thr Glu lie Ala Ala Ser Leu Leu Tyr His Asp Asn Val lie Ser Lys 
485 490 495 



Lys val Arg Asp Leu Leu Thr Gly Asp Pro Ser Lys lie Asn Leu Asn 
500 505 510 



Met Asp Gin Phe Leu Glu Gin Ala Leu Gin Met Asn Tyr Leu Glu Asn 
515 520 525 

lie Thr Gin Leu lie Pro lie lie Glu Ala Met Leu His val Asn Asn 
530 535 540 



Ser Ala Asp Ala Ser Glu Lys Pro Gly Gin Leu Leu Glu Met Phe Lys 
545 550 555 560 
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Asn val Glu Glu Leu Lys Glu Asp Leu Arg Arg Thr Thr Gly Met Ser 
565 570 575 

Asn Arg Thr lie Asp Lys Leu Leu Ala lie Pro lie Pro Asp Asn Arg 
580 585 590 

Ala Glu He lie Ser Gin val Phe Trp Leu His ser Cys Asp Thr Asn 
595 600 605 

lie Thr Thr Pro Lys Leu Glu Asp Ala Met Lys Glu Phe Cys Asn Leu 
610 ~ 615 620 

Ser Leu Ser Glu Arg Ser Arg Gin Ser Tyr Leu lie Gly Leu Thr Leu 
625 630 635 640 

Leu His Tyr Leu Asn lie Tyr Asn Phe Thr Asp Lys Val Phe Phe Pro 
645 650 655 

Arg Lys Asp Gin Lys Pro Val Glu Lys Met Met Glu Leu Phe lie Arg 
660 665 670 

Leu Lys Glu lie Leu Asn Gin Met Ala Ser Gly Thr His Pro Leu Leu 
W 675 680 685 

R Asp Lys Met Arg Ser Leu Lys Gin Met His Leu Pro Arg Ser Val Pro 

H 690 695 700 

%l 

HI Leu Thr Gin Ala Met Tyr Arg Ser Asn Arg Met Asn Thr Pro Gin Gly 

j|j 705 710 715 720 

fe? Ser Phe Ser Thr lie Ser Gin Ala Leu Cys Ser Gin Gly lie Thr Thr 
O 725 730 735 

i= Glu Tyr Leu Thr Ala Met Leu Pro Ser ser Gin Arg Pro Lys Gly Asn 
M 740 745 ~ 750 



M His Thr Lys Asp Phe Leu Thr Tyr Lys Leu Thr Lys Glu Gin lie Ala 

f|| 755 760 , 765 

y ser Lys Tyr Gly lie Pro lie Asn Thr Thr Pro Phe Cys Phe Ser Leu 

ill 770 775 780 

Tyr Lys Asp He lie Asn Met Pro Ala Gly Pro val lie Trp Ala Phe 

785 790 795 800 

Leu Lys Pro Met Leu Leu Gly Arg lie Leu His Ala Pro Tyr Asn Pro 

805 ~ 810 815 

val Thr Lys Ala lie Met Glu Lys Ser Asn val Thr Leu Arg Gin Leu 

820 825 830 

Ala Glu Leu Arg Glu Lys Ser Gin Glu Trp Met Asp Lys Ser Pro Leu 
835 '840 845 

Phe Met Asn Ser Phe His Leu Leu Asn Gin Ala lie Pro Met Leu Gin 

850 855 860 

Asn Thr Leu Arg Asn Pro Phe val Gin val Phe val Lys Phe ser val 

865 870 875 880 

Gly Leu Asp Ala Val Glu Leu Leu Lys Gin lie Asp Glu Leu Asp lie 

885 890 895 
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Leu Arg Leu Lys Leu Glu Asn Asn lie Asp lie lie Asp Gin Leu Asn 
900 905 910 

Thr Leu Ser ser Leu Thr val Asn lie Ser Ser Cys val Leu Tyr Asp 
915 920 925 

Arg lie Gin Ala Ala Lys Thr lie Asp Glu Met Glu Arg Glu Ala Lys 
930 935 940 

Arg Leu Tyr Lys ser Asn Glu Leu Phe Gly Ser val lie Phe Lys Leu 
945 950 955 960 

Pro Ser Asn Arg Ser Trp His Arg Gly Tyr Asp Ser Gly Asn val Phe 
965 970 975 

Leu Pro Pro val lie Lys Tyr Thr lie Arg Met Ser Leu Lys Thr Ala 
980 985 990 

Gin Thr Thr Arg Ser Leu Arg Thr Lys lie Trp Ala Pro Gly Pro His 
995 " 1000 1005 

Asn Ser Pro Ser His Asn Gin lie Tyr Gly Arg Ala Phe lie Tyr 
U 1010 1015 1020 

yi Leu Gin Asp Ser lie Glu Arg Ala lie lie Glu Leu Gin Thr Gly 
p 1025 1030 1035 

-' Arg Asn Ser Gin Glu lie Ala Val Gin Val Gin Ala lie Pro Tyr 
% 1040 1045 1050 

CI Pro Cys Phe Met Lys Asp Asn Phe Leu Thr Ser val Ser Tyr Ser 
q 1055 1060 1065 

1 Leu Pro lie val Leu Met val Ala Trp val val Phe lie Ala Ala 
CI 1070 1075 1080 

LI Phe Val Lys Lys Leu Val Tyr Glu Lys Asp Leu Arg Leu His Glu 
1085 1090 1095 

O Tyr Met Lys Met Met Gly Val Asn ser cys Ser His Phe Phe Ala 
flJ 1100 1105 ' 1110 

Trp Leu lie Glu Ser Val Gly Phe Leu Leu val Thr lie Val lie 
1115 1120 1125 

Leu lie lie lie Leu Lys Phe Gly Asn lie Leu Pro Lys Thr Asn 
1130 1135 1140 

Gly Phe lie Leu Phe Leu Tyr Phe ser Asp Tyr ser Phe Ser val 
1145 1150 1155 

He Ala Met ser Tyr Leu lie Ser val Phe Phe Asn Asn Thr Asn 
1160 1165 1170 

lie Ala Ala Leu lie Gly Ser Leu lie Tyr lie lie Ala Phe Phe 
1175 1180 1185 

pro Phe He val Leu val Thr val Glu Asn Glu Leu ser Tyr val 
1190 1195 1200 

Leu Lys val Phe Met Ser Leu Leu Ser Pro Thr Ala Phe Ser Tyr 
1205 1210 1215 

Ala Ser Gin Tyr lie Ala Arg Tyr Glu Glu Gin Gly lie Gly Leu 
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1220 1225 1230 

Gin Trp Glu Asn Met Tyr Thr Ser Pro val Gin Asp Asp Thr Thr 
1235 1240 1245 

ser Phe Gly Trp Leu Cys Cys Leu lie Leu Ala Asp Ser Phe lie 
1250 1255 1260 

Tyr Phe Leu lie Ala Trp Tyr Val Arg Asn val Phe Pro Gly Thr 
1265 1270 1275 

Tyr Gly Met Ala Ala Pro Trp Tyr Phe Pro lie Leu Pro Ser Tyr 
1280 1285 1290 

Trp Lys Glu Arg Phe Gly Cys Ala Glu val Lys Pro Glu Lys Ser 
1295 " 1300 1305 

Asn Gly Leu Met Phe Thr Asn lie Met Met Gin Asn Thr Asn Pro 
1310 1315 1320 

Ser Ala Ser Pro Glu Tyr Met Phe Ser Ser Asn lie Glu Pro Glu 
1325 1330 1335 

Pro Lys Asp Leu Thr Val Gly val Ala Leu His Gly Val Thr Lys 
1340 1345 1350 

lie Tyr Gly Ser Lys val Ala val Asp Asn Leu Asn Leu Asn Phe 
1355 1360 1365 

Tyr Glu Gly His lie Thr Ser Leu Leu Gly Pro Asn Gly Ala Gly 
1370 1375 1380 

Lys Thr Thr Thr lie Ser Met Leu Thr Gly Leu Phe Gly Ala Ser 
1385 1390 1395 

Ala Gly Thr lie Phe val Tyr Gly Lys Asp lie Lys Thr Asp Leu 
1400 1405 1410 

His Thr val Arg Lys Asn Met Gly val Cys Met Gin His Asp val 
1415 ' 1420 1425 

Leu Phe Ser Tyr Leu Thr Thr Lys Glu His Leu Leu Leu Tyr Gly 
1430 1435 1440 

ser lie Lys val Pro His Trp Thr Lys Lys Gin Leu His Glu Glu 
1445 1450 1455 

Val Lys Arg Thr Leu Lys Asp Thr Gly Leu Tyr Ser His Arg His 
1460 " 1465 ' 1470 

Lys Arg Val Gly Thr Leu Ser Gly Gly Met Lys Arg Lys Leu Ser 
1475 1480 1485 

lie Ser lie Ala Leu lie Gly Gly Ser Arg val Val lie Leu Asp 
1490 1495 1500 

Glu Pro Ser Thr Gly Val Asp Pro Cys Ser Arg Arg Ser lie Trp 
1505 1510 " 1515 

Asp val lie Ser Lys Asn Lys Thr Ala Arg Thr lie lie Leu Ser 
1520 1525 1530 

Thr His His Leu Asp Glu Ala Glu Val Leu Ser Asp Arg lie Ala 
1535 1540 1545 
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Phe Leu Glu Gin Gly Gly Leu Arg Cys Cys Gly Ser Pro Phe Tyr 
1550 1555 1560 

Leu Lys Glu Ala Phe Gly Asp Gly Tyr His Leu Thr Leu Thr Lys 
1565 1570 1575 

Lys Lys val Phe Leu Asn Leu Thr Lys Glu Ser Gin Lys Asn ser 
1580 1585 1590 

Ala Met ser Leu Glu His Leu Thr Gin Lys Lys lie Gly Asn Ser 
1595 1600 ' 1605 

Asn Ala Asn Gly lie Ser Thr Pro Asp Asp Leu ser val Ser Ser 
1610 1615 1620 

Ser Asn Phe Thr Asp Arg Asp Asp Lys lie Leu Thr Arg Gly Glu 
1625 1630 1635 

Arg Leu Asp Gly Phe Gly Leu Leu Leu Lys Lys lie Met Ala lie 
1640 1645 1650 

M 1 Leu lie Lys Arg Phe His His Ala Arg Arg Asn Trp Lys Gly Leu 
0 1655 1660 ~ 1665 

H lie Ala Gin val lie Leu Pro lie val Phe val Thr Thr Ala Met 
% i 1670 1675 1680 

ru 

if! Gly Leu Gly Thr Leu Arg Asn Ser Ser Asn Ser Tyr Pro Glu lie 
?! 1685 1690 1695 

P Gin lie Ser Pro Ser Leu Tyr Gly Thr Ser Xaa Gin Thr Ala Phe 
1700 1705 1710 

S"5 Tyr Ala Asn Tyr His Pro Ser Thr Glu Ala Leu Val Ser Ala Met 
|M 1715 1720 1725 

111 Trp Asp Phe Pro Gly lie Asp Asn Met Cys Leu Asn Thr Ser Asp 
1730 1735 1740 

n? Leu Gin cys Leu Asn Lys Asp Ser Leu Glu Lys Trp Asn Thr ser 
1745 1750 1755 

Gly Glu Pro lie Thr Asn Phe Gly val Cys Ser Cys Ser Glu Asn 
1760 1765 1770 

val Gin Glu Cys Pro Lys Phe Asn Tyr Ser Pro Pro His Arg Arg 
1775 1780 1785 

Thr Tyr Ser Ser Gin Val lie Tyr Asn Leu Thr Gly Gin Arg Val 
1790 1795 1800 

Glu Asn Tyr Leu lie Ser Thr Ala Asn Glu Phe val Gin Lys Arg 
1805 1810 1815 

Tyr Gly Gly Trp Ser Phe Gly Leu Pro Leu Thr Lys Asp Leu Arg 
1820 1825 1830 

Phe Asp lie Thr Gly Val Pro Ala Asn Arg Thr Leu Ala Lys Val 
1835 1840 1845 

Trp Tyr Asp Pro Glu Gly Tyr His Ser Leu Pro Ala Tyr Leu Asn 
1850 1855 1860 
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ser Leu Asn Asn Phe Leu Leu Arg val Asn Met Ser Lys Tyr Asp 
1865 1870 1875 

Ala Ala Arg His Gly lie lie Met Tyr ser His Pro Tyr Pro Gly 
1880 1885 1890 

val Gin Asp Gin Glu Gin Ala Thr lie Ser Ser Leu lie Asp lie 
1895 1900 1905 

Leu val Ala Leu Ser lie Leu Met Gly Tyr Ser val Thr Thr Ala 
1910 1915 1920 

Ser Phe Val Thr Tyr val val Arg Glu His Gin Thr Lys Ala Lys 
1925 1930 1935 

Gin Leu Gin His lie Ser Gly lie Gly Val Thr Cys Tyr Trp Val 
1940 1945 1950 

Thr Asn Phe lie Tyr Asp Met val Phe Tyr Leu val Pro Val Ala 
1955 1960 1965 

Phe Ser lie Gly lie lie Ala lie Phe Lys Leu Pro Ala Phe Tyr 
1970 1975 1980 

Ser Glu Asn Asn Leu Gly Ala Val Ser Leu Leu Leu Leu Leu Phe 
£? 1985 1990 1995 

si Gly His Ala Thr Phe Ser Trp Met Tyr Leu Leu Ala Gly Leu Phe 
J 2000 2005 ' 2010 

0 His Glu Thr Gly Met Ala Phe lie Thr Tyr val Cys val Asn Leu 
& 2015 * 2020 2025 

L Phe Phe Gly lie Asn Ser lie Val Ser Leu Ser Val Val Tyr Phe 
y 2030 2035 2040 

ju Leu Ser Lys Glu Lys Pro Asn Asp Pro Thr Leu Glu Leu lie Ser 
m 2045 2050 2055 



SI Glu Thr Leu Lys Arg He Phe Leu lie Phe Pro Gin Phe cys Phe 



2060 



2065 



2070 



Gly Tyr Gly Leu lie Glu Leu Ser Gin Gin Gin Ser Val Leu Asp 
2075 2080 2085 

Phe Leu Lys Ala Tyr Gly val -Glu Tyr Pro Asn Glu Thr Phe Glu 
2090 2095 2100 

Met Asn Lys Leu Gly Ala Met Phe val Ala Leu val Ser Gin Gly 
2105 2110 2115 

Thr Met Phe Phe Ser Leu Arg Leu Leu lie Asn Glu Ser Leu lie 
2120 2125 2130 

Lys Lys Leu Arg Leu Phe Phe Arg Lys Phe Asn Ser Ser His val 
2135 " 2140 ~ 2145 

Arg Glu Thr lie Asp Glu Asp Glu Asp val Arg Ala Glu Arg Leu 
2150 2155 2160 

Arg Val Glu Ser Gly Ala Ala Glu Phe Asp Leu Val Gin Leu Tyr 
2165 2170 2175 

Cys Leu Thr Lys Thr Tyr Gin Leu lie His Lys Lys lie lie Ala 
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2180 



2185 



2190 



val Asn Asn He Ser He Gly lie Pro Ala Gly Glu Cys Phe Gly 
2195 2200 2205 

Leu Leu Gly val Asn Gly Ala Gly Lys Thr Thr lie Phe Lys Met 
2210 2215 2220 

Leu Thr Gly Asp lie lie Pro Ser Ser Gly Asn lie Leu lie Arg 
2225 2230 2235 

Asn Lys Thr Gly Ser Leu Gly His Val Asp Ser His Ser Ser Leu 
2240 2245 2250 

val Gly Tyr Cys Pro Gin Glu Asp Ala Leu Asp Asp Leu Val Thr 
2255 2260 2265 

val Glu Glu His Leu Tyr Phe Tyr Ala Arg val His Gly lie Pro 
2270 2275 2280 

Glu Lys Asp lie Lys Glu Thr val His Lys Leu Leu Arg Arg Leu 
2285 2290 2295 

H His Leu Met Pro Phe Lys Asp Arg Ala Thr Ser Met Cys Ser Tyr 
g 2300 2305 2310 

H Gly Thr Lys Arg Lys Leu Ser Thr Ala Leu Ala Leu lie Gly Lys 
H 2315 2320 2325 



Pro Ser lie Leu Leu Leu Asp Glu Pro Ser Ser Gly Met Asp Pro 



2330 



2335 



2340 



0 Lys ser Lys Arg His Leu Trp Lys lie lie Ser Glu Glu Val Gin 
2345 2350 2355 

5; Asn Lys Cys Ser val lie Leu Thr Ser His Ser Met Glu Glu Cys 
|M 2360 2365 2370 

m Glu Ala Leu Cys Thr Arg Leu Ala lie Met val Asn Gly Lys Phe 
K 2375 2380 2385 

RI Gin Cys lie Gly Ser Leu Gin His lie Lys Ser Arg Phe Gly Arg 
2390 2395 2400 

Gly Phe Thr Val Lys Val His Leu Lys Asn Asn Lys val Thr Met 
2405 2410 2415 

Glu Thr Leu Thr Lys Phe Met Gin Leu His Phe Pro Lys Thr Tyr 
2420 2425 2430 

Leu Lys Asp Gin His Leu ser Met Leu Glu Tyr His Val Pro val 
2435 2440 2445 

Thr Ala Gly Gly val Ala Asn lie Phe Asp Leu Leu Glu Thr Asn 
2450 ' 2455 2460 

Lys Thr Ala Leu Asn lie Thr Asn Phe Leu val ser Gin Thr Thr 
2465 2470 2475 

Leu Glu Glu val Phe lie Asn Phe Ala Lys Asp Gin Lys Ser Tyr 
2480 2485 2490 

Glu Thr Ala Asp Thr Ser Ser Gin Gly Ser Thr lie Ser val Asp 
2495 2500 2505 
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Ser Gin Asp Asp Gin Met Glu Ser 
2510 2515 



<210> 7 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 
<400> 7 

gaagagttga ttgagaagtg c 
M <210> 8 



<211> 21 



M <212> DNA 

fli . , 

j\ <213> Artificial 



s <220> 

^ ! <223> primer 

M <400> 8 

pj cgaagagaac tatgtgacag c 

fli <210> 9 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 9 

cttctcacaa gtgcaagagc 

<210> 10 

<211> 24 

<212> DNA 



<213> Artificial 



<220> 

<223> primer 

<400> 10 

cgcaatggtt cctatgaaga ttac 

<210> 11 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 11 

cagaagggtg agtccgatga ggtaagac 

<210> 12 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 12 

gctgtcacat agttctcttc g 

<210> 13 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 13 

gtaatcttca taggaaccat tgcg 



<210> 14 

<211> 25 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 14 

cctacacacg gtacggaaga acatg 

<210> 15 

. <211> 27 

Q <212> DNA 

r= <213> Artificial 



| <220> 

W <223> primer 
CI <400> 15 

rff gccatcgtca taagagagtt ggaacac 



fl| <210> 16 

pi <211> 19 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 16 

gtgcttatgg ttgcctggg 

<210> 17 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 
<400> 17 

cttccatctg ttaaaccagg 

<210> 18 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 
<400> 18 

ggtgttctgg ctgcattc 

<210> 19 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 
<400> 19 

gcctcatcta catcattgcc 

<210> 20 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 
<400> 20 

gtgttccaac tctcttatga cgatggc 



<210> 21 

<211> 25 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 
<400> 21 

catgttcttc cgtaccgtgt gtagg 

<210> 22 

<211> 20 

- <212> DNA 

El <213> Artificial 

:,. i 
.t;. :; 

5 <220> 

Q <223> primer 
W <400> 22 

jl 5 ggcaatgatg tagatgaggc 

H <210> 

hi <2ii> 

0 

p J <212> 
<213> 



23 
19 
DNA 

Artificial 



<220> 

<223> primer 

<400> 23 

cccaggcaac cataagcac 

<210> 24 

<211> 30 

<212> DNA 

<213> Artificial 



<220> 



<223> primer 
<400> 24 

cttttctact ggcttttgat ctttcctcgg 



<210> 25 

<211> 19 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 25 

ccttgatagg gaaaccttc 

■pi 

Cj <210> 26 

pj <211> 20 

jj <212> DNA 

p <213> Artificial 

iiii 

p| <220> 

p§ <223> primer 

Cl <400> 26 

ffj caccagcata tacattagca 

<210> 27 

<211> 19 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 
<400> 27 

gaaggtttcc ctatcaagg 
<210> 28 



<211> 
<212> 
<213> 



27 
DNA 

Arti f i ci al 



<220> 

<223> primer 

<400> 28 

gtatcatgta ccagtcacag caggagg 

<210> 29 

<211> 28 

<212> DNA 

<213> Artificial 



Q 

hi <220> 

fll 

'J\ <223> primer 
CI <400> 29 

Q ccaaagacca gaagtcctat gaaactgc 



y <210> 30 

U <211> 20 

!J: <212> DNA 

R <213> Artificial 



<220> 

<223> primer 

<400> 30 

gagtggagaa gaaaagtcag 

<210> 31 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 



<223> primer 

<400> 31 

cacggaacct agattcactc c 

<210> 32 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 32 

cccagagcaa gtgatttc 

<210> 33 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 33 

cgagtgcccg taggagtg 

<210> 34 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 34 

ttgcacctag tttattcatc tc 



<210> 35 
<211> 23 



<212> DNA 
<213> Artificial 



<220> 

<223> primer 

<400> 35 

gtcataaatg aagtttgtta ccc 

<210> 36 

<211> 21 

<212> DNA 

<213> Artificial 

CI <220> 

ft <223> primer 

111 <400> 36 

■H caacagttat ccagagattc a 

O <210> 37 

p <211> 19 

HI <212> DNA 

hi <213> Artificial 



<220> 

<223> primer 

<400> 37 

gagtccctgc caatagaac 

<210> 38 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 



<400> 38 

gcaaatgcag tatgtgacac 
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